Skip to content

Latest commit

 

History

History
1814 lines (1751 loc) · 195 KB

CHANGELOG.md

File metadata and controls

1814 lines (1751 loc) · 195 KB

Changelog for Peloton

0.9.0 (unreleased)

0.8.12

0.8.11

0.8.10

0.8.9.6

0.8.9.1

0.8.8

0.8.7

0.8.6

0.8.5

0.8.4

0.8.3.1

0.8.3

0.8.2.1

0.8.2

0.8.1

0.8.0

0.7.8.1

0.7.8

0.7.7.3

0.7.7.2

0.7.7.1

0.7.7

0.7.6

0.7.5.2

0.7.5.1

0.7.5

0.7.4

0.7.3

0.7.2 ------------------

0.7.1.3

0.7.1.2

0.7.1.1

0.7.1

0.7.0

0.6.14

0.6.13

0.6.12

  • 2018-04-19 Hide admission of non-preemptible jobs behind a flag Anant Vyas
  • 2018-04-20 Checking mesos taskId before removing task from tracker Mayank Bansal
  • 2018-04-19 Enable Aurora health check for Peloton Tengfei Mu
  • 2018-04-19 Update changelog for 0.6.12 Tengfei Mu
  • 2018-04-19 Fixing race condition between removing task from tracker and adding the same task with different mesos task id Mayank Bansal
  • 2018-04-19 Update health.leader when candidate is not leader Zhixin Wen
  • 2018-04-19 Add Host APIs Sachin Sharma
  • 2018-04-18 Add comment for channel 'finished' in resmgr/recovery.go Sachin Sharma
  • 2018-04-18 fix unit test broken by revert of 4533a25 Zhixin Wen
  • 2018-04-18 eventstream client send correct purgeOffset upon restart Zhixin Wen
  • 2018-04-18 unset completion time when task is running Zhixin Wen
  • 2018-04-17 Revert "Revert "Add 100k task per job limit to master code"" Aditya Bhave
  • 2018-04-17 Retry Do not recover FAILED jobs till archiver is committed Tengfei Mu
  • 2018-04-17 Revert "Rearchitect the job manager to use the cache and the goal state engine" Tengfei Mu
  • 2018-04-17 Revert "Do not recover FAILED jobs till archiver is committed." Tengfei Mu
  • 2018-04-17 Revert "Add 100k task per job limit to master code" Tengfei Mu
  • 2018-04-17 Revert "Fix completion time for jobs moving from PENDING to KILLED" Tengfei Mu
  • 2018-04-16 Fix completion time for jobs moving from PENDING to KILLED Aditya Bhave
  • 2018-04-16 Add max_retry_attempts for test__create_job to pass smoketest Chunyang Shen
  • 2018-04-12 Add 100k task per job limit to master code Aditya Bhave
  • 2018-04-13 enable host tags for metrics Zhixin Wen
  • 2018-04-10 Bump up C* timeouts and add timers to recovery code Aditya Bhave
  • 2018-04-12 Add Host Maintenance API Sachin Sharma
  • 2018-04-11 Change GC and compaction for tables with large partitions Aditya Bhave
  • 2018-04-10 Adding errorcodes in communication between resmgr and jobmgr for enqueuegangs Mayank Bansal
  • 2018-04-10 fix potential memory leak in priorityQueue Zhixin Wen
  • 2018-03-28 Make preemptor aware of non-preemptible tasks Anant Vyas
  • 2018-03-26 Admission control for non-preemptible gangs Anant Vyas
  • 2018-04-09 remove unused api.ResultSet to pass lint Zhixin Wen
  • 2018-04-09 Reconcile Staging Tasks Varun Gupta
  • 2018-04-05 Add script to do performance comparison betwwen two versions Chunyang Shen
  • 2018-04-04 Push to registry docker-registry02-sjc1:5055 Chunyang Shen
  • 2018-04-04 Do not recover FAILED jobs till archiver is committed. Apoorva Jindal
  • 2018-04-03 Fix docker build script and update ATG registry Chunyang Shen
  • 2018-03-22 Rearchitect the job manager to use the cache and the goal state engine Apoorva Jindal
  • 2018-04-02 Add a log when transient DB error occur on the hostmgr eventstream path Apoorva Jindal
  • 2018-03-29 Fix resmgr reason for state transition Apoorva Jindal
  • 2018-04-02 Update Glide installation in Makefile Chunyang Shen
  • 2018-03-26 Don't log UUID in sentry error Anant Vyas
  • 2018-03-04 Add a common library to implement a goal state engine Apoorva Jindal
  • 2018-03-23 Rename metric tag from type to result for success/fail Charles Raimbert
  • 2018-03-22 Delete job_index entry as part of DeleteJob Aditya Bhave
  • 2018-03-20 Address remaining review comments on in-memory DB Apoorva Jindal

0.6.11

  • 2018-03-21 Pin down YARPC version in glide to avoid uber fx Charles Raimbert
  • 2018-03-21 Use patched docker/libkv for ZooKeeper Leader Election Charles Raimbert
  • 2018-03-21 Use long running job fixture for test__stop_long_running_batch_job_immediately Anant Vyas
  • 2018-03-20 Modify GetTasksForJobAndStates to accept []TaskState parameter instead of []string Sachin Sharma
  • 2018-03-15 Add integration test for Job Query API Aditya Bhave
  • 2018-03-16 Do not update the state transition reason on dequeue from placement engine Apoorva Jindal
  • 2018-03-15 Correct scheduled task accounting in case of launch errors for maxRunningInstance feature Apoorva Jindal
  • 2018-03-04 Add cache to job manager. Apoorva Jindal
  • 2018-03-19 Adding support for static respool in Tree hierarchy and Entitlement Mayank Bansal
  • 2018-03-12 Add support to query jobs by timerange Aditya Bhave
  • 2018-03-08 Be able to teardown vcluster in any fail in launching or testing vcluster Chunyang Shen
  • 2018-03-14 Add runtime info to jobquery cli output Aditya Bhave
  • 2018-03-14 Always evaluate a job for maxRunningInstaces SLA irrespective of job runtime updater result Apoorva Jindal
  • 2018-03-13 Adding Static reservation type in to resourcepool config Mayank Bansal
  • 2018-03-08 Add integration tests for controller task Anant Vyas
  • 2018-03-06 Add a monitor job for vcluster to send data to M3 Chunyang Shen
  • 2018-03-08 Enable integration test for fetching logs of previous task runs of failed task Apoorva Jindal
  • 2018-03-09 Dividing entitlement calculation to phases and adding more tests to entitlement Mayank Bansal
  • 2018-03-07 Do not overwrite killed state for partially completed jobs Apoorva Jindal
  • 2018-03-08 Add 'task query' command to CLI to query on tasks(for a job) by state(s) Sachin Sharma
  • 2018-03-07 Fix race condition in state machine rollback Anant Vyas

0.6.10.5

0.6.10.4

  • 2018-03-06 Remove 7 day time span restriction from querying active jobs [email protected]

0.6.10.3

0.6.10.2

0.6.10.1

  • 2018-02-28 Revert "Add 'task query' command to CLI to query on tasks(for a job) by state(s)" [email protected]

0.6.10

0.6.9

0.6.8.2

  • 2018-02-06 Untrack failed tasks with goal state succeeded. Apoorva Jindal

0.6.8.1

  • 2018-02-03 Fix migrate script for job_index Aditya Bhave

0.6.8

  • 2018-02-02 Removing race between different transitions in state machine Mayank Bansal
  • 2018-02-02 Adding mesos quota support in cluster capacity call for host manager Mayank Bansal
  • 2018-01-31 Schema and DB change to speed up JobQuery Aditya Bhave
  • 2018-02-02 Adding Limit support for resource pools Mayank Bansal
  • 2018-01-31 Adding apidoc in docs folder from build Mayank Bansal
  • 2018-01-31 Adding peloton engdocs Mayank Bansal
  • 2018-01-02 Add extra logging in state machine implementation Anant Vyas
  • 2018-01-31 Changing api docs to html format Mayank Bansal
  • 2018-01-25 Ignore failure event due to duplicate task ID message from Mesos Apoorva Jindal
  • 2018-01-26 Send kill of PENDING tasks to resource manager Apoorva Jindal
  • 2018-01-24 Send initialized tasks during recovery as a batch to resource manager Apoorva Jindal
  • 2018-01-24 Guard against any case when hostname may be missing in offer pool. Zhitao Li
  • 2018-01-22 Add Script for performance test running Chunyang Shen
  • 2018-01-11 Fix sorting based on creation/completion time in job query Apoorva Jindal
  • 2018-01-24 Do not run job action with a context timeout. Apoorva Jindal
  • 2018-01-23 Revert "Temporarily, do not recover initialized tasks in non-initialized jobs in job manager" Apoorva Jindal
  • 2018-01-08 shutdown executor after task kill timeout Chunyang Shen

0.6.7

0.6.6

0.6.5

0.6.4

  • 2017-12-14 Skip terminal jobs during job manager sync from DB @apoorvaj
  • 2017-12-14 Added the mesos podtask @pourchet

0.6.3

  • 2017-12-14 Increase MaxRecvMsgSize in gRPC to 256MB @min
  • 2017-12-14 Merge the placement engine from the master branch into release @kejlberg
  • 2017-12-13 Move metrics gauage update to asynchronous @zhitao
  • 2017-12-13 Update volume state upon stateful task running status update @mu
  • 2017-12-13 Add more logging for jobmgr to launch stateful @mu
  • 2017-12-13 Fixing Integration test preprod cluster zk address @mabansal
  • 2017-12-13 Add reservation cleaner to clean both unused volume and resources @mu
  • 2017-12-13 Add job goal state to job manager @apoorvaj
  • 2017-12-13 Add materialized view for volume by state @mu

0.6.2

  • 2017-12-12 Adding more logging to entitlelement calculator in resmgr @Mayank Bansal
  • 2017-12-12 Revert "Check in mocks" @Antoine Pourchet
  • 2017-12-12 Adding deadline feature in Peloton @Mayank Bansal
  • 2017-12-08 Add changelog for changes between 0.5.0 and 0.6.0 @Anant Vyas

0.6.1

  • 2017-12-08 Improve Resource Manager recovery performance @Anant Vyas
  • 2017-12-06 Add materialized view for volumes by job ids @Tengfei Mu
  • 2017-12-07 Update task runtime state when receiving a mesos kill event @Apoorva Jindal
  • 2017-12-06 Do not update runtime reason on mesos update always @Apoorva Jindal
  • 2017-12-06 Move volumesvc from hostmgr to jobmgr @Tengfei Mu
  • 2017-12-07 Check in mocks @Tengfei Mu
  • 2017-12-05 Kill orphaned tasks in mesos @Apoorva Jindal
  • 2017-12-05 Implement volume list and delete API @Tengfei Mu
  • 2017-12-04 Add reason and message for every update to task runtime @Apoorva Jindal
  • 2017-12-04 Return failed instance list in task stop and task start @Apoorva Jindal
  • 2017-12-01 Handle task start of failed tasks @Apoorva Jindal
  • 2017-11-28 Restart the goal state when placement received for a task which needs to be killed. @Apoorva Jindal
  • 2017-11-29 Handle stopped tasks during reconcialiation. @Apoorva Jindal
  • 2017-12-01 Add yaml files for performance tests @Apoorva Jindal
  • 2017-11-30 Remove smoketest tag from preemption integ test @Anant Vyas
  • 2017-11-21 Porting storage changes from master to release @Apoorva Jindal