Recent Releases of cortex
cortex - v0.42.1
v0.42.1
New features
- Add support for new set of EC2 instances amongst which the
c6andg5families can be found https://github.com/cortexlabs/cortex/issues/2414 (RobertLucian)
Bug fixes
- Esthetic fix where the VPC CNI logging functionality was triggering warn logs when running the
cortexCLI https://github.com/cortexlabs/cortex/pull/2443 (RobertLucian)
Misc
- Update Cortex dependency versions; eksctl, EKS to 1.22, AWS IAM, Python, etc https://github.com/cortexlabs/cortex/issues/2414 (RobertLucian, deliahu)
- Go
Published by RobertLucian over 3 years ago
cortex - v0.42.0
v0.42.0
New features
- Add support for the Classic Load Balancer for APIs; the Network Load Balancer remains the default (docs) https://github.com/cortexlabs/cortex/pull/2413 https://github.com/cortexlabs/cortex/issues/2414 (RobertLucian)
Bug fixes
- Fix Async API http/tcp probes when probing the empty root path (
/) https://github.com/cortexlabs/cortex/pull/2407 (RobertLucian) - Fix nil pointer exception in the
cortex cluster exportcommand https://github.com/cortexlabs/cortex/pull/2415 https://github.com/cortexlabs/cortex/issues/2414 (RobertLucian) - Ensure that user-specified environment variables are ordered deterministically in the Kubernetes deployment spec https://github.com/cortexlabs/cortex/pull/2411 (deliahu)
Misc
- Ensure that the batch on-job-complete request contains a valid JSON body https://github.com/cortexlabs/cortex/pull/2409 (RobertLucian)
- Go
Published by deliahu about 4 years ago
cortex - v0.41.0
v0.41.0
New features
- Support configurable
pre_stopcommand for containers https://github.com/cortexlabs/cortex/pull/2403 (docs) (deliahu)
Misc
- Support m6i instance types https://github.com/cortexlabs/cortex/pull/2398 (deliahu)
- Update to Kubernetes v1.21 https://github.com/cortexlabs/cortex/pull/2398 (deliahu)
Bug fixes
- Wait for in-flight requests to reach zero before terminating the proxy container https://github.com/cortexlabs/cortex/pull/2402 (deliahu)
- Fix
cortex get --envcommand https://github.com/cortexlabs/cortex/pull/2404 (deliahu) - Fix cluster price estimate during
cortex cluster upfor spot node groups with on-demand base capacity https://github.com/cortexlabs/cortex/pull/2406 (RobertLucian)
Nucleus Model Server
We have released v0.1.0 of the Nucleus model server!
Nucleus is a model server for TensorFlow and generic Python models. It is compatible with Cortex clusters, Kubernetes clusters, and any other container-based deployment platforms. Nucleus can also be run locally via Docker compose.
Some of Nucleus's features include:
- Generic Python models (PyTorch, ONNX, Sklearn, MLFlow, Numpy, Pandas, etc)
- TensorFlow models
- CPU and GPU support
- Serve models directly from S3 paths
- Configurable multiprocessing and multithreadding
- Multi-model endpoints
- Dynamic server-side request batching
- Automatic model reloading when new model versions are uploaded to S3
- Model caching based on LRU policy (on disk and memory)
- HTTP and gRPC support
- Go
Published by deliahu about 4 years ago
cortex - v0.40.0
v0.40.0
New features
- Support concurrency for Async APIs (via the
max_concurrencyfield) https://github.com/cortexlabs/cortex/pull/2376 https://github.com/cortexlabs/cortex/issues/2200 (miguelvr) - Add graphs for cluster-wide and per-API cost breakdowns to the cluster metrics dashboard https://github.com/cortexlabs/cortex/pull/2382 https://github.com/cortexlabs/cortex/issues/1962 (RobertLucian)
- Allow worker nodes containing Async APIs to scale to zero (now a shared async gateway is used, which runs on the operator node group) https://github.com/cortexlabs/cortex/pull/2380 https://github.com/cortexlabs/cortex/issues/2279 (vishalbollu)
- Add
cortex describe API_NAMEcommand for Realtime and Async APIs https://github.com/cortexlabs/cortex/pull/2368 https://github.com/cortexlabs/cortex/issues/2320 https://github.com/cortexlabs/cortex/issues/2359 (RobertLucian) - Support updating the priority of an existing node group https://github.com/cortexlabs/cortex/pull/2369 https://github.com/cortexlabs/cortex/issues/2254 (vishalbollu)
Misc
- Improve the reporting of API statuses https://github.com/cortexlabs/cortex/pull/2368 https://github.com/cortexlabs/cortex/issues/2320 https://github.com/cortexlabs/cortex/issues/2359 (RobertLucian)
- Remove the default readiness probe on the target port if a custom readiness probe is specified in the API spec https://github.com/cortexlabs/cortex/pull/2379 (RobertLucian)
- Go
Published by deliahu over 4 years ago
cortex - v0.39.1
v0.39.1
Bug fixes
- Remove an unnecessary cluster validation which limited the IP ranges that could be used in
api_load_balancer_cidr_white_listandoperator_load_balancer_cidr_white_listhttps://github.com/cortexlabs/cortex/pull/2363 (RobertLucian)
- Go
Published by deliahu over 4 years ago
cortex - v0.39.0
v0.39.0
New features
- Add
cortex cluster healthcommand to show the health of the cluster's components https://github.com/cortexlabs/cortex/pull/2313 https://github.com/cortexlabs/cortex/issues/2029 (miguelvr) - Forward request headers to AsyncAPIs https://github.com/cortexlabs/cortex/pull/2329 https://github.com/cortexlabs/cortex/issues/2296 (miguelvr)
- Add metrics dashboard for Task APIs https://github.com/cortexlabs/cortex/pull/2311 https://github.com/cortexlabs/cortex/pull/2322 (RobertLucian)
Reliability
- Enable larger cluster sizes (up to 1000 nodes with 10000 pods) by enabling IPVS https://github.com/cortexlabs/cortex/pull/2357 https://github.com/cortexlabs/cortex/issues/1834 (RobertLucian)
- Automatically limit the rate at which nodes are added to avoid overloading the Kubernetes API server https://github.com/cortexlabs/cortex/pull/2331 https://github.com/cortexlabs/cortex/pull/2338 https://github.com/cortexlabs/cortex/issues/2314 (RobertLucian)
- Ensure cluster autoscaler availability https://github.com/cortexlabs/cortex/pull/2347 https://github.com/cortexlabs/cortex/issues/2346 (RobertLucian)
- Improve istiod availability at large scale https://github.com/cortexlabs/cortex/pull/2342 https://github.com/cortexlabs/cortex/issues/2332 (RobertLucian)
- Reduce metrics shown in
cortex getto improve scalability and reliability of the command https://github.com/cortexlabs/cortex/pull/2333 https://github.com/cortexlabs/cortex/issues/2319 (vishalbollu) - Show aggregated node statistics in the cluster dashboard https://github.com/cortexlabs/cortex/pull/2336 https://github.com/cortexlabs/cortex/issues/2318 (RobertLucian)
Bug fixes
- Ensure that the
Content-Typeheader is properly set toapplication/jsonfor responses to Async API submissions https://github.com/cortexlabs/cortex/pull/2323 (vishalbollu) - Fix pod autoscaler scale-to-zero edge cases https://github.com/cortexlabs/cortex/pull/2350 (miguelvr)
- Allow autoscaling configuration to be updated on a running API https://github.com/cortexlabs/cortex/pull/2355 (RobertLucian)
- Fix node group priority calculation for the cluster autoscaler https://github.com/cortexlabs/cortex/pull/2358 https://github.com/cortexlabs/cortex/pull/2343 (RobertLucian, deliahu)
- Allow the
node_groupsselector to be updated in a running API https://github.com/cortexlabs/cortex/pull/2354 (RobertLucian) - Fix the active replicas graph on the Async API dashboard https://github.com/cortexlabs/cortex/pull/2328 (RobertLucian)
Docs
- Add a guide for running in production https://github.com/cortexlabs/cortex/pull/2334 https://github.com/cortexlabs/cortex/issues/2317 (vishalbollu)
- Add a guide for configuring an HTTP API Gateway https://github.com/cortexlabs/cortex/pull/2341 (deliahu)
Misc
- Add a graph of the number of active and queued requests to the Async API dashboard https://github.com/cortexlabs/cortex/pull/2326 https://github.com/cortexlabs/cortex/issues/1960 (deliahu)
- Add a graph of the number of instances to the cluster dashboard https://github.com/cortexlabs/cortex/pull/2336 https://github.com/cortexlabs/cortex/issues/2318 (RobertLucian)
- Ensure that
cortex cluster info --print-configdisplays YAML that is consumable bycortex cluster configurehttps://github.com/cortexlabs/cortex/pull/2324 (vishalbollu)
- Go
Published by deliahu over 4 years ago
cortex - v0.38.0
v0.38.0
New features
- Support autoscaling down to zero replicas for Realtime APIs https://github.com/cortexlabs/cortex/pull/2298 https://github.com/cortexlabs/cortex/issues/445 (miguelvr)
- Allow
ssl_certificate_arn,api_load_balancer_cidr_white_list, andoperator_load_balancer_cidr_white_listto be updated on an existing cluster (via thecortex cluster configurecommand) https://github.com/cortexlabs/cortex/pull/2305 https://github.com/cortexlabs/cortex/issues/2107 (vishalbollu) - Allow Prometheus's instance type to be configured (docs) https://github.com/cortexlabs/cortex/pull/2307 https://github.com/cortexlabs/cortex/issues/2285 (RobertLucian)
- Allow multiple Inferentia chips to be assigned to a single container https://github.com/cortexlabs/cortex/pull/2304 https://github.com/cortexlabs/cortex/issues/1123 (deliahu)
Bug fixes
- Fix cluster autoscaler's nodegroup priority calculation https://github.com/cortexlabs/cortex/pull/2309 (RobertLucian)
Misc
- Various scalability improvements https://github.com/cortexlabs/cortex/pull/2307 https://github.com/cortexlabs/cortex/pull/2304 https://github.com/cortexlabs/cortex/issues/2297 https://github.com/cortexlabs/cortex/issues/2278 https://github.com/cortexlabs/cortex/issues/2285
- Allow setting a nodegroup's
max_instancesto0https://github.com/cortexlabs/cortex/pull/2310 (RobertLucian)
- Go
Published by deliahu over 4 years ago
cortex - v0.37.0
v0.37.0
New features
- Support ARM instance types https://github.com/cortexlabs/cortex/pull/2268 https://github.com/cortexlabs/cortex/issues/1528 (RobertLucian)
- Add
cortex cluster configurecommand to add, remove, or scale nodegroups on a running cluster https://github.com/cortexlabs/cortex/pull/2246 https://github.com/cortexlabs/cortex/issues/2096 (RobertLucian) - Add
cortex cluster info --print-configcommand to print the current configuration of a running cluster https://github.com/cortexlabs/cortex/pull/2246 (RobertLucian) - Add metrics dashboard for Async APIs https://github.com/cortexlabs/cortex/pull/2242 https://github.com/cortexlabs/cortex/issues/1958 (miguelvr)
- Support
cortex refreshcommand for Async APIs https://github.com/cortexlabs/cortex/pull/2265 https://github.com/cortexlabs/cortex/issues/2237 (deliahu)
Breaking changes
- The
cortex cluster scalecommand has been replaced by thecortex cluster configurecommand.
Bug fixes
- Fix Async API metrics reporting for non-200 response status codes https://github.com/cortexlabs/cortex/pull/2266 (miguelvr)
- Make batch job metrics persistence resilient to instance termination https://github.com/cortexlabs/cortex/pull/2247 https://github.com/cortexlabs/cortex/issues/2041 (vishalbollu)
- Make network validations during
cortex cluster upmore permissive (to avoid unnecessarily failing checks on GovCloud) https://github.com/cortexlabs/cortex/pull/2248 (vishalbollu) - Fix Inferentia resource requests https://github.com/cortexlabs/cortex/pull/2250 (RobertLucian)
Docs
- Add instructions for exporting logs and metrics to external tools (vishalbollu)
Misc
- Improve output of
cortex cluster infofor running batch jobs https://github.com/cortexlabs/cortex/pull/2270 (deliahu) - Persist Batch job metrics regardless of job status https://github.com/cortexlabs/cortex/pull/2244 (miguelvr)
- Support creating clusters with no node groups https://github.com/cortexlabs/cortex/pull/2269 (deliahu)
- Improve handling of container startup errors in batch jobs with multiple containers https://github.com/cortexlabs/cortex/pull/2260 https://github.com/cortexlabs/cortex/issues/2217 (vishalbollu)
- Add CPU and memory resource requests to the proxy and dequeuer containers https://github.com/cortexlabs/cortex/pull/2252 (deliahu)
- Go
Published by deliahu over 4 years ago
cortex - v0.36.0
v0.36.0
New features
- Support running arbitrary Docker containers in all workload types (Realtime, Async, Batch, Task) https://github.com/cortexlabs/cortex/pull/2173 (RobertLucian, miguelvr, vishalbollu, deliahu, ospillinger)
- Support autoscaling Async APIs to zero replicas https://github.com/cortexlabs/cortex/pull/2224 https://github.com/cortexlabs/cortex/issues/2199 (RobertLucian)
Breaking changes
- With this release, we have generalized Cortex to exclusively support running arbitrary Docker containers for all workload types (Realtime, Async, Batch, and Task). This enables the use of any model server, programming language, etc. As a result, the API configuration has been updated: the
predictorsection has been removed, thepodsection has been added, and theautoscalingparameters have been modified slightly (depending on the workload type). See updated docs for Realtime, Async, Batch, and Task. If you'd like to to see examples of Dockerizing Python applications, see our test/apis folder. - The
cortex prepare-debugcommand has been removed; Cortex now exclusively runs Docker containers, which can be run locally viadocker run. - The
cortex patchcommand as been removed; its behavior is now identical tocortex deploy. - The
cortex logscommand now prints a CloudWatch Insights URL with a pre-populated query which can be executed to show logs from your workloads, since this is the recommended approach in production. If you wish to stream logs from a pod at random, you can usecortex logs --random-pod(keep in mind that these logs will not include some system logs related to your workload). - gRPC support has been temporarily removed; we are working on adding it back in v0.37.
Bug fixes
- Handle exception when initializing the Python client when the default environment is not set https://github.com/cortexlabs/cortex/pull/2225 https://github.com/cortexlabs/cortex/issues/2223 (deliahu)
Docs
- Document how to configure SMTP in Grafana (e.g to enable email alerts) https://github.com/cortexlabs/cortex/pull/2219 (RobertLucian)
Misc
- Show CloudWatch Insights URL with a pre-populated query in the output of
cortex logshttps://github.com/cortexlabs/cortex/issues/2085 (vishalbollu) - Improve efficiency of batch job submission validations https://github.com/cortexlabs/cortex/pull/2179 https://github.com/cortexlabs/cortex/issues/2178 (deliahu)
- Go
Published by deliahu over 4 years ago
cortex - v0.35.0
v0.35.0
New features
- Avoid processing HTTP requests that have been cancelled by the client https://github.com/cortexlabs/cortex/pull/2135 https://github.com/cortexlabs/cortex/issues/1453 (vishalbollu)
- Support GP3 volumes (and make GP3 the default volume type) https://github.com/cortexlabs/cortex/pull/2130 https://github.com/cortexlabs/cortex/issues/1843 (RobertLucian)
- Allow setting the shared memory (shm) size for Task APIs https://github.com/cortexlabs/cortex/pull/2132 https://github.com/cortexlabs/cortex/issues/2115 (RobertLucian)
- Implement automatic 7-day expiration for Async API responses https://github.com/cortexlabs/cortex/pull/2151 (RobertLucian)
- Add
cortex env renamecommand https://github.com/cortexlabs/cortex/pull/2165 https://github.com/cortexlabs/cortex/issues/1773 (deliahu)
Breaking changes
- The Python client methods which deploy Python classes have been separated from the
deploy()method. Now,deploy()is used only to deploy project folders, anddeploy_realtime_api(),deploy_async_api(),deploy_batch_api(), anddeploy_task_api()are for deploying Python classes. (docs) - The name of the bucket that Cortex uses for internal purposes is no longer configurable. During cluster creation, Cortex will auto-generate the bucket name (and create the bucket if it doesn't exist). During cluster deletion, the bucket will be emptied (unless the
--keep-aws-resourcesflag is provided tocortex cluster down). Users' files should not be stored in the Cortex internal bucket.
Bug fixes
- Fix the number of Async API replicas shown in
cortex cluster infohttps://github.com/cortexlabs/cortex/pull/2140 https://github.com/cortexlabs/cortex/issues/2129 (RobertLucian)
Misc
- Delete all cortex-created AWS resources when deleting a cluster, and support the
--keep-aws-resourcesflag withcortex cluster downto preserve AWS resources https://github.com/cortexlabs/cortex/pull/2161 https://github.com/cortexlabs/cortex/issues/1612 (RobertLucian) - Validate the user's AWS service quota for number of security groups and in/out rules during cluster creation https://github.com/cortexlabs/cortex/pull/2127 https://github.com/cortexlabs/cortex/issues/2087 (RobertLucian)
- Allow specifying only one of
--min-instancesor--max-instanceswithcortex cluster scalehttps://github.com/cortexlabs/cortex/pull/2149 (RobertLucian) - Use 405 status code for un-implemented Realtime API methods https://github.com/cortexlabs/cortex/pull/2158 (RobertLucian)
- Decrease file size and project size limits https://github.com/cortexlabs/cortex/pull/2152 (deliahu)
- Set the default environment name to the cluster name when creating a cluster https://github.com/cortexlabs/cortex/pull/2164 https://github.com/cortexlabs/cortex/issues/1546 (deliahu)
- Go
Published by deliahu almost 5 years ago
cortex - v0.34.0
v0.34.0
New features
- Support handling
GET,PUT,PATCH, andDELETEHTTP requests in Realtime APIs (docs) https://github.com/cortexlabs/cortex/pull/2111 https://github.com/cortexlabs/cortex/issues/2063 (RobertLucian) - Support running realtime API containers locally for debugging / development purposes (docs) https://github.com/cortexlabs/cortex/pull/2112 https://github.com/cortexlabs/cortex/issues/2077 (vishalbollu)
- Support multiple gRPC services / methods (which can be named arbitrarily) in a single Realtime API (docs) https://github.com/cortexlabs/cortex/pull/2111 https://github.com/cortexlabs/cortex/issues/2063 (RobertLucian)
- Support specifying a list of node groups on which a workload is allowed to run (see configuration docs for Realtime, Async, Batch, or Task APIs) https://github.com/cortexlabs/cortex/pull/2098 https://github.com/cortexlabs/cortex/issues/2034 (RobertLucian)
- Support AWS GovCloud regions https://github.com/cortexlabs/cortex/pull/2118 https://github.com/cortexlabs/cortex/issues/2103 (vishalbollu)
Breaking changes
- "predictor" has been renamed to "handler" throughout the product (API configuration and Python APIs). In addition, as a result of supporting additional HTTP method verbs,
predict()has been renamed tohandle_post()in Realtime APIs (handle_get(),handle_put(),handle_patch(), andhandle_delete()are now also supported). For consistency,predict()has been renamed tohandle_async()for Async APIs, andhandle_batch()for Batch APIs. See the examples for Realtime, Async, and Batch APIs. Task APIs have not been changed.
Bug fixes
- Fix invalid Async workload status during processing https://github.com/cortexlabs/cortex/pull/2106 https://github.com/cortexlabs/cortex/issues/2104 (RobertLucian)
Docs
- Add docs for configuring Grafana alerts (RobertLucian)
- Document how to create a Cortex cluster without administrator IAM access (vishalbollu)
- Add docs for mirroring Cortex's docker images to a private repo (vishalbollu)
Misc
- Support json output for the
cortex cluster infocommand https://github.com/cortexlabs/cortex/pull/2089 https://github.com/cortexlabs/cortex/issues/2062 (RobertLucian) - Allow nodegroups to be scaled down to
max_instances== 0 https://github.com/cortexlabs/cortex/pull/2095 (deliahu)
- Go
Published by deliahu almost 5 years ago
cortex - v0.33.0
v0.33.0
New features
- Allow specifying a CIDR range whitelist for APIs and the operator (docs) https://github.com/cortexlabs/cortex/pull/2071 https://github.com/cortexlabs/cortex/issues/2003 (vishalbollu)
- Enable CORS for async, batch, and task APIs https://github.com/cortexlabs/cortex/pull/2082 https://github.com/cortexlabs/cortex/issues/2073 (deliahu)
Breaking changes
- The onnx predictor type has been replaced by the python predictor type; please use the python predictor type instead (all onnx models are fully supported by the python predictor type)
Bug fixes
- Fix bug affecting async api consistency during heavy traffic https://github.com/cortexlabs/cortex/pull/2072 (RobertLucian)
- Fix bug affecting async api updates https://github.com/cortexlabs/cortex/pull/2067 (vishalbollu)
Misc
- Rename
cortex cluster configurecommand tocortex cluster scalehttps://github.com/cortexlabs/cortex/pull/2040 https://github.com/cortexlabs/cortex/issues/1972 (RobertLucian) - Disable AZRebalance autoscaling group process https://github.com/cortexlabs/cortex/pull/2042 https://github.com/cortexlabs/cortex/issues/1349 (RobertLucian)
- Add horizontal pod autoscaler to async API gateway https://github.com/cortexlabs/cortex/pull/2079 https://github.com/cortexlabs/cortex/issues/2078 (RobertLucian)
- Rename async modules to
async_apito avoid name collision with the reserved keyword in Python 3.7+ https://github.com/cortexlabs/cortex/pull/2066 https://github.com/cortexlabs/cortex/issues/2052 (vishalbollu) - Backup images to dockerhub https://github.com/cortexlabs/cortex/pull/2081 (vishalbollu)
- Add additional debugging info for
cluster upfailures https://github.com/cortexlabs/cortex/pull/2080 https://github.com/cortexlabs/cortex/issues/2027 (vishalbollu)
- Go
Published by deliahu almost 5 years ago
cortex - v0.32.0
v0.32.0
New features
- Add gRPC support to realtime APIs (docs) https://github.com/cortexlabs/cortex/pull/1997 https://github.com/cortexlabs/cortex/issues/1056 (RobertLucian)
- Add support for ONNX and TensorFlow predictor types in async APIs (docs) https://github.com/cortexlabs/cortex/pull/1996 https://github.com/cortexlabs/cortex/issues/1980 (miguelvr)
- Support using ECR images from other AWS accounts and regions https://github.com/cortexlabs/cortex/pull/2011 https://github.com/cortexlabs/cortex/issues/1988 (vishalbollu)
Breaking changes
- GCP support has been removed so that we can focus our efforts on improving the scalability, reliability, and security for Cortex on AWS. Cortex on GCP will still be available in v0.31. If you are currently using Cortex on GCP, our team will be happy to help you migrate to AWS or work with you to find alternative solutions. Please feel free to reach out to us on slack or email us at hello@cortex.dev if you're interested.
Bug fixes
- Fix memory plots on Grafana dashboards for realtime and batch APIs https://github.com/cortexlabs/cortex/pull/2024 https://github.com/cortexlabs/cortex/pull/2014 https://github.com/cortexlabs/cortex/issues/1970 (RobertLucian)
Docs
- Misc docs improvements https://github.com/cortexlabs/cortex/pull/1994 (ospillinger)
Misc
- Increase kubelet's
registryPullQPSlimit from 5 to 10 https://github.com/cortexlabs/cortex/pull/2023 https://github.com/cortexlabs/cortex/issues/1989 (miguelvr) - Pin the AMI version https://github.com/cortexlabs/cortex/pull/2010 https://github.com/cortexlabs/cortex/issues/1975 https://github.com/cortexlabs/cortex/issues/1615 (vishalbollu)
- Go
Published by deliahu almost 5 years ago
cortex - v0.31.1
v0.31.1
Bug fixes
- Preemptible node pools on GCP aren't autoscaling https://github.com/cortexlabs/cortex/pull/1981 (vishalbollu)
- Replica autoscaler targets incorrect deployments on operator restart https://github.com/cortexlabs/cortex/pull/1982 (miguelvr)
- Replica autoscaler is not reinitialized for running APIs on operator restart on GCP https://github.com/cortexlabs/cortex/pull/1984 (vishalbollu)
- Go
Published by vishalbollu almost 5 years ago
cortex - v0.31.0
v0.31.0
New features
- Add support for AsyncAPI (experimental) (docs) https://github.com/cortexlabs/cortex/pull/1935 https://github.com/cortexlabs/cortex/issues/1610 (miguelvr)
- Add support for multi-instance-type clusters to AWS/GCP providers (experimental) (aws/gcp docs) https://github.com/cortexlabs/cortex/pull/1951 (RobertLucian)
- Allow users to duplicate/mirror traffic using shadow pipelines https://github.com/cortexlabs/cortex/pull/1948 https://github.com/cortexlabs/cortex/issues/1889 (docs) (vishalbollu)
Breaking changes
on_demand_backupin cluster configuration has been removed in favour of using a cluster with a mixture of spot and on-demand nodegroups. See multi-instance documentation for aws and gcp for more details.
Bug fixes
- Fix Python client not respecting CORTEXCLICONFIG_DIR environment variable for client-id.txt https://github.com/cortexlabs/cortex/pull/1953 (jackmpcollins)
- Prevent threads from being stuck in DynamicBatcher https://github.com/cortexlabs/cortex/pull/1915 (cbensimon)
- Fix unexpected cortex logs termination by increasing buffer size https://github.com/cortexlabs/cortex/pull/1939 (vishalbollu)
- Decouple cluster deletion from EBS volume deletion for cortex cluster down https://github.com/cortexlabs/cortex/pull/1954 (deliahu)
- Fix spot/on-demand GPU instances not joining the cluster by upgrading to eksctl 0.40.0 https://github.com/cortexlabs/cortex/pull/1955 (vishalbollu)
- Prevent premature queue not found errors by preserving the SQS for minutes till after the job has completed https://github.com/cortexlabs/cortex/pull/1952 (vishalbollu)
Docs
- Update docs https://github.com/cortexlabs/cortex/pull/1949 (ospillinger)
Misc
- Configure a default cortex client to manage APIs from with cortex workloads https://github.com/cortexlabs/cortex/pull/1942 https://github.com/cortexlabs/cortex/issues/1644 (RobertLucian)
- Save batch metrics to cloud to preserve job metrics history https://github.com/cortexlabs/cortex/pull/1940 (vishalbollu)
- Go
Published by vishalbollu almost 5 years ago
cortex - v0.30.0
v0.30.0
New features
- Record custom metrics from predictors and view them in Grafana (docs) https://github.com/cortexlabs/cortex/pull/1910 https://github.com/cortexlabs/cortex/issues/1897 (miguelvr)
- Add granular pod metrics to the Grafana dashboards https://github.com/cortexlabs/cortex/pull/1905 (RobertLucian)
- Add node metrics to Grafana dashboards https://github.com/cortexlabs/cortex/pull/1900 (miguelvr)
Breaking changes
- Remove support for installing Cortex on your own Kubernetes Cluster https://github.com/cortexlabs/cortex/pull/1921 (RobertLucian)
Bug fixes
- Fix bug where successfully completed jobs were marked as completed with errors https://github.com/cortexlabs/cortex/pull/1913 (vishalbollu)
- Fix bug where batch jobs were being terminated unnecessarily https://github.com/cortexlabs/cortex/pull/1917 (vishalbollu)
- Prevent cluster autoscaler from reallocating job pods https://github.com/cortexlabs/cortex/pull/1919 (vishalbollu)
- Address AWS cluster up quota issues such not enough NAT Gateways or EIPs https://github.com/cortexlabs/cortex/pull/1912 (RobertLucian)
- Delete unused prometheus volume on cluster down https://github.com/cortexlabs/cortex/pull/1863 (miguelvr)
- Create .cortex dir if not present https://github.com/cortexlabs/cortex/pull/1909 (RobertLucian)
Docs
- Add docs for accessing dashboard through private load balancer (docs) https://github.com/cortexlabs/cortex/pull/1907 (deliahu)
Misc
- Allow specifying paths for requirements.txt, conda-packages.txt & dependencies.sh (docs) https://github.com/cortexlabs/cortex/pull/1896 https://github.com/cortexlabs/cortex/pull/1927 https://github.com/cortexlabs/cortex/issues/1777 (miguelvr)
- Log relevant kubernetes events to API specific log streams https://github.com/cortexlabs/cortex/pull/1906 https://github.com/cortexlabs/cortex/issues/833 (miguelvr)
- Support credentials using AWSSESSIONTOKEN with the CLI/Client (docs) https://github.com/cortexlabs/cortex/pull/1908 https://github.com/cortexlabs/cortex/pull/1920 https://github.com/cortexlabs/cortex/issues/1134 https://github.com/cortexlabs/cortex/issues/1865 (vishalbollu)
- Provide auth to Operator and APIs by attaching IAM policies to the cluster (docs) https://github.com/cortexlabs/cortex/pull/1908 https://github.com/cortexlabs/cortex/issues/1858 (vishalbollu)
- Go
Published by vishalbollu almost 5 years ago
cortex - v0.29.0
v0.29.0
New features
- Add Grafana dashboard for APIs (docs) https://github.com/cortexlabs/cortex/pull/1867 https://github.com/cortexlabs/cortex/pull/1885 https://github.com/cortexlabs/cortex/pull/1890 https://github.com/cortexlabs/cortex/pull/1887 (miguelvr)
- Support API autoscaling in GCP clusters (docs) https://github.com/cortexlabs/cortex/pull/1814 https://github.com/cortexlabs/cortex/pull/1879 https://github.com/cortexlabs/cortex/issues/1601 (miguelvr)
- Support traffic splitting in GCP clusters (docs) https://github.com/cortexlabs/cortex/pull/1892 https://github.com/cortexlabs/cortex/issues/1660 (miguelvr)
Breaking changes
- The default Docker images for APIs have been slimmed down to not include packages other than what Cortex requires to function. Therefore, when deploying APIs, it is now necessary to include the dependencies that your predictor needs in
requirements.txt(docs) and/ordependencies.sh(docs).
Bug fixes
- Disable dynamic batcher for TensorFlow predictor type https://github.com/cortexlabs/cortex/pull/1888 (miguelvr)
- Support empty directory objects for models saved in S3/GCS https://github.com/cortexlabs/cortex/pull/1830 https://github.com/cortexlabs/cortex/issues/1829 (RobertLucian)
- Fix bug which prevented Task APIs on GCP from being cleaned up after completion https://github.com/cortexlabs/cortex/pull/1871 (RobertLucian)
Docs
- Add documentation for using a version of Python other than the default via
dependencies.sh(docs) or custom images (docs) https://github.com/cortexlabs/cortex/pull/1862 https://github.com/cortexlabs/cortex/issues/1779 (RobertLucian)
Misc
- Support deploying predictor Python classes from more environments (e.g. from separate Python files, AWS Lambda) https://github.com/cortexlabs/cortex/pull/1883 https://github.com/cortexlabs/cortex/commit/3a1b777d06e660a49b6223badda4c5e8b1fe4ec1 https://github.com/cortexlabs/cortex/issues/1824 https://github.com/cortexlabs/cortex/issues/1826 (vishalbollu)
- Improve error logging for Batch and Task APIs https://github.com/cortexlabs/cortex/pull/1866 https://github.com/cortexlabs/cortex/issues/1833 (RobertLucian)
- Go
Published by deliahu about 5 years ago
cortex - v0.28.0
v0.28.0
New features
- Support installing Cortex on an existing Kubernetes cluster (on AWS or GCP) (docs) https://github.com/cortexlabs/cortex/pull/1837 https://github.com/cortexlabs/cortex/issues/1808 (vishalbollu)
Breaking changes
- The cloudwatch dashboard has been removed as a result of our switch to Prometheus for metrics aggregation. The dashboard will be replaced with an alternative in an upcoming release.
Bug fixes
- Fix bug which can cause requests to APIs from a Python client to timeout during cluster autoscaling https://github.com/cortexlabs/cortex/pull/1841 https://github.com/cortexlabs/cortex/issues/1840 (RobertLucian)
- Fix bug which can cause
downscale_stabilization_periodto be disregarded during downscaling https://github.com/cortexlabs/cortex/pull/1847 https://github.com/cortexlabs/cortex/issues/1846 (RobertLucian)
Misc
- AWS credentials are no longer required to connect the CLI to the cluster operator. If you need to restrict access to your cluster operator, configure the operator's load balancer to be private by setting
operator_load_balancer_scheme: internalin your cluster configuration file, and set up VPC Peering. We plan in supporting a new auth strategy in an upcoming release. - Improve S6 error code/signal handling https://github.com/cortexlabs/cortex/pull/1825 https://github.com/cortexlabs/cortex/issues/1703 (RobertLucian)
- Go
Published by deliahu about 5 years ago
cortex - v0.27.0
v0.27.0
New features
- Add new API type
TaskAPIfor running arbitrary Python jobs (docs) https://github.com/cortexlabs/cortex/pull/1717 https://github.com/cortexlabs/cortex/issues/253 (miguelvr, RobertLucian) - Write Cortex's logs as structured logs, and allow use of Cortex's structured logger in predictors (supports adding extra fields) (aws docs, gcp docs) https://github.com/cortexlabs/cortex/pull/1778 https://github.com/cortexlabs/cortex/pull/1803 https://github.com/cortexlabs/cortex/pull/1804 https://github.com/cortexlabs/cortex/issues/1732 https://github.com/cortexlabs/cortex/issues/1563 (vishalbollu)
- Support preemptible instances on GCP (docs) https://github.com/cortexlabs/cortex/pull/1791 https://github.com/cortexlabs/cortex/issues/1631 (RobertLucian)
- Support private load balancers on GCP (docs) https://github.com/cortexlabs/cortex/pull/1786 https://github.com/cortexlabs/cortex/issues/1621 (deliahu)
- Support GCP instances with multiple GPUs (docs) https://github.com/cortexlabs/cortex/pull/1789 https://github.com/cortexlabs/cortex/issues/1784 (deliahu)
Breaking changes
cortex logsnow streams logs from a single replica at random when there are multiple replicas for an API. The recommended way to analyze production logs is via a dedicated logging tool (by default, logs are sent to CloudWatch on AWS and StackDriver on GCP)
Bug fixes
- Misc Python client fixes https://github.com/cortexlabs/cortex/pull/1798 https://github.com/cortexlabs/cortex/pull/1782 https://github.com/cortexlabs/cortex/pull/1772 (vishalbollu, RobertLucian)
Docs
- Document the shared
/mntdirectory for TensorFlow predictors https://github.com/cortexlabs/cortex/pull/1802 https://github.com/cortexlabs/cortex/issues/1792 (deliahu) - Misc GCP docs improvements https://github.com/cortexlabs/cortex/pull/1799 (deliahu)
Misc
- Improve out-of-memory status reporting (RobertLucian)
- Improve batch job cleanup process https://github.com/cortexlabs/cortex/pull/1797 https://github.com/cortexlabs/cortex/pull/1796 (vishalbollu)
- Remove grpc msg send/receive limit https://github.com/cortexlabs/cortex/pull/1769 https://github.com/cortexlabs/cortex/issues/1740 (RobertLucian)
- Go
Published by deliahu about 5 years ago
cortex - v0.26.0
v0.26.0
New features
- Support configuring the log level for APIs (docs) https://github.com/cortexlabs/cortex/pull/1741 https://github.com/cortexlabs/cortex/issues/1484 (RobertLucian)
- Support creating a cluster in an existing AWS VPC (docs) https://github.com/cortexlabs/cortex/pull/1759 https://github.com/cortexlabs/cortex/issues/1142 (deliahu)
- Support specifying the GCP network and subnet for the Cortex cluster (docs) https://github.com/cortexlabs/cortex/pull/1752 https://github.com/cortexlabs/cortex/issues/1738 (deliahu)
- Support configuring shared memory size (shm) for inter-process communication (docs) https://github.com/cortexlabs/cortex/pull/1756 https://github.com/cortexlabs/cortex/issues/1638 (vishalbollu)
Breaking changes
- The local provider has been removed. The best way to test your predictor implementation locally is to import it in a separate Python file and call your
__init__()andpredict()functions directly. The best way to test your API is to deploy it to a dev/test cluster. - Built-in support for API Gateway has been removed. If you need to create an https endpoint with valid certs, some options are to set up a custom domain or to manually create an API Gateway.
- Prediction monitoring has been removed. We are exploring how to build a more powerful and customizable solution for this.
- The
predictCLI command has been deleted.curl,requests, etc. are the best tools for testing APIs.
Bug fixes
- For multi-model APIs, allow model names to share a prefix https://github.com/cortexlabs/cortex/pull/1745 https://github.com/cortexlabs/cortex/issues/1699 (RobertLucian)
Docs
- Misc docs improvements (ospillinger)
- Go
Published by deliahu about 5 years ago
cortex - v0.25.0
v0.25.0
New features
- Support server-side micro batching for the Python predictor (docs) https://github.com/cortexlabs/cortex/pull/1653 https://github.com/cortexlabs/cortex/issues/1382 (miguelvr)
- Add timeout configuration for batch jobs (docs) https://github.com/cortexlabs/cortex/pull/1712 https://github.com/cortexlabs/cortex/issues/1324 (vishalbollu)
- Support batch retries (docs) https://github.com/cortexlabs/cortex/pull/1713 https://github.com/cortexlabs/cortex/issues/1540 (lapaniku, vishalbollu)
- Support sending failed batches to a dead-letter queue (docs) https://github.com/cortexlabs/cortex/pull/1713 https://github.com/cortexlabs/cortex/issues/1541 (lapaniku, vishalbollu)
- Support installing the cortex Python client in predictors https://github.com/cortexlabs/cortex/pull/1709 https://github.com/cortexlabs/cortex/issues/1670 https://github.com/cortexlabs/cortex/issues/1206 (RobertLucian)
Breaking changes
- The
predictor.model_pathfield of the realtime api configuration has been moved topredictor.models.path. In addition, for the Python predictor type,predictor.modelshas been renamed topredictor.multi_model_reloading. Here is the entire API configuration schema.
Bug fixes
- Misc batch reliability improvements https://github.com/cortexlabs/cortex/pull/1705 https://github.com/cortexlabs/cortex/pull/1718 https://github.com/cortexlabs/cortex/pull/1729 (vishalbollu)
Docs
- Reorganize the docs structure https://github.com/cortexlabs/cortex/pull/1696 https://github.com/cortexlabs/cortex/pull/1701 https://github.com/cortexlabs/cortex/pull/1704 https://github.com/cortexlabs/cortex/pull/1719 https://github.com/cortexlabs/cortex/issues/1675 (ospillinger)
- Add GCP to the contributing guide https://github.com/cortexlabs/cortex/pull/1720 https://github.com/cortexlabs/cortex/issues/1654 (deliahu)
- Add docs for setting up kubectl on GCP https://github.com/cortexlabs/cortex/commit/759b4b144c25cc623e1b385b036f83825d122db7 (deliahu)
Misc
- Parse the request body as a string when content type
text/plainis specified https://github.com/cortexlabs/cortex/pull/1714 (deliahu) - Support paths to single ONNX files in API configuration https://github.com/cortexlabs/cortex/pull/1711 https://github.com/cortexlabs/cortex/issues/1686 (RobertLucian)
- Support deploying public S3 models on GCP, and public GCS models on AWS https://github.com/cortexlabs/cortex/pull/1694 https://github.com/cortexlabs/cortex/issues/1684 (RobertLucian)
- Pre-download docker images when creating GCP clusters https://github.com/cortexlabs/cortex/pull/1721 https://github.com/cortexlabs/cortex/issues/1658 (deliahu)
- Speed up the validation processes for multi-model APIs https://github.com/cortexlabs/cortex/pull/1690 https://github.com/cortexlabs/cortex/issues/1663 (RobertLucian)
- Go
Published by deliahu about 5 years ago
cortex - v0.24.1
v0.24.1
Bug fixes
- Propagate the exit code from the predictor's initialization so that the API status is set to "error" when initialization fails https://github.com/cortexlabs/cortex/issues/1680 https://github.com/cortexlabs/cortex/pull/1691 (RobertLucian)
- Go
Published by deliahu about 5 years ago
cortex - v0.24.0
v0.24.0
New features
- Add GCP support: our initial release supports all three predictor types (Python, TensorFlow, ONNX), on CPU or GPU, with live reloading, multi-model caching, and cluster autoscaling https://github.com/cortexlabs/cortex/pull/1655 https://github.com/cortexlabs/cortex/pull/1672 https://github.com/cortexlabs/cortex/pull/1667 https://github.com/cortexlabs/cortex/issues/1661 https://github.com/cortexlabs/cortex/issues/114 https://github.com/cortexlabs/cortex/issues/1600 https://github.com/cortexlabs/cortex/issues/1602 https://github.com/cortexlabs/cortex/issues/1616 https://github.com/cortexlabs/cortex/issues/1624 (RobertLucian, deliahu, vishalbollu)
- Add the patch command to the CLI and Python client, which can be used to update an API using only the API configuration (without needing to provide the predictor's Python implementation) https://github.com/cortexlabs/cortex/pull/1651 https://github.com/cortexlabs/cortex/pull/1666 https://github.com/cortexlabs/cortex/issues/1329 (vishalbollu)
- Support deploying predictor Python classes from the Python client https://github.com/cortexlabs/cortex/pull/1587 https://github.com/cortexlabs/cortex/issues/1617 (see the tutorial for an example) (vishalbollu)
Breaking changes
- The Python client's
deploy()function has been renamed tocreate_api(), and some of the argument names have changed (docs)
Bug fixes
- Enable CORS for APIs accessed via API Gateway or load balancer https://github.com/cortexlabs/cortex/pull/1649 https://github.com/cortexlabs/cortex/issues/1234 (RobertLucian, deliahu)
- Fix local TensorFlow models when live reloading is enabled https://github.com/cortexlabs/cortex/pull/1668 https://github.com/cortexlabs/cortex/issues/1554 (RobertLucian)
- Prevent TensorFlow multi-model caching from attempting to download local models from S3 https://github.com/cortexlabs/cortex/pull/1669 https://github.com/cortexlabs/cortex/issues/1598 (RobertLucian)
Docs
- Miscellaneous docs improvements (vishalbollu, ospillinger)
Misc
- Improve Python client cross Python version compatibility https://github.com/cortexlabs/cortex/pull/1640 (vishalbollu)
- Reinstall TensorFlow and ONNX dependencies when the Python version is overridden https://github.com/cortexlabs/cortex/pull/1652 (vishalbollu)
- Terminate container when bootloader script fails https://github.com/cortexlabs/cortex/pull/1639 (vishalbollu)
- Go
Published by deliahu about 5 years ago
cortex - v0.23.0
v0.23.0
New features
- Update Python client
deploy()to accept a Python dictionary for API configuration (previously, only a file path was supported) (docs) https://github.com/cortexlabs/cortex/pull/1587 (vishalbollu) - Show API deployment history in
cortex get API_NAMEcommand https://github.com/cortexlabs/cortex/pull/1544 https://github.com/cortexlabs/cortex/issues/1496 (deliahu) - Add
cortex export API_NAMEandcortex export API_NAME API_IDcommands to export specific and historical API deployments https://github.com/cortexlabs/cortex/pull/1544 https://github.com/cortexlabs/cortex/issues/1497 (deliahu) - Build and push
python-predictor-gpu-slimimage with different combinations of cuda and cudnn (cuda10.0-cudnn7,cuda10.1-cudnn7,cuda10.1-cudnn8,cuda10.2-cudnn7,cuda10.2-cudnn8,cuda11.0-cudnn8,cuda11.1-cudnn8) (docs) https://github.com/cortexlabs/cortex/pull/1575 https://github.com/cortexlabs/cortex/issues/1574 (deliahu)
Bug fixes
- Allow local deployments of public S3 models without requiring AWS credentials https://github.com/cortexlabs/cortex/pull/1589 https://github.com/cortexlabs/cortex/issues/1588 (RobertLucian)
Docs
- Add guide for avoiding Docker Hub rate limits https://github.com/cortexlabs/cortex/pull/1576 (RobertLucian, deliahu)
- Add guide for self-hosting Cortex's Docker images https://github.com/cortexlabs/cortex/pull/1579 (RobertLucian, deliahu)
Misc
- Remove API request maximum payload size limit https://github.com/cortexlabs/cortex/pull/1583 (deliahu)
- Switch to Quay docker container registry https://github.com/cortexlabs/cortex/pull/1578 (deliahu, RobertLucian)
- Go
Published by deliahu about 5 years ago
cortex - v0.22.1
v0.22.1
Bug fixes
- Set the predictor's working directory to the root Cortex project directory https://github.com/cortexlabs/cortex/pull/1573 https://github.com/cortexlabs/cortex/issues/1572 (deliahu)
- Allow
max_instancesto be updated viacortex cluster configurehttps://github.com/cortexlabs/cortex/pull/1568 https://github.com/cortexlabs/cortex/issues/1567 (deliahu) - Gracefully stop the serving container when a multi-processed cron throws exception https://github.com/cortexlabs/cortex/pull/1560 https://github.com/cortexlabs/cortex/issues/1552 (RobertLucian)
Docs
- Demonstrate how to make API requests with various payload types (binary, form fields, etc), and show how to access them in
predict()https://github.com/cortexlabs/cortex/pull/1566 (docs) - Misc docs improvements https://github.com/cortexlabs/cortex/pull/1551 https://github.com/cortexlabs/cortex/pull/1556 c3dab4045a61703cb1db1d5f95776614252f96c0 https://github.com/cortexlabs/cortex/pull/1557 (deliahu, RobertLucian)
Misc
- Build and upload the Python package/CLI to a public S3 bucket https://github.com/cortexlabs/cortex/pull/1562 (vishalbollu)
- Go
Published by deliahu over 5 years ago
cortex - v0.22.0
v0.22.0
New features
- Multi-model caching: serve a collection of models that is collectively bigger than what will fit in memory (via LRU cache eviction) (docs) https://github.com/cortexlabs/cortex/pull/1428 https://github.com/cortexlabs/cortex/issues/619 (RobertLucian)
- Live reloading: support updating models in running APIs by adding new versions to the model's S3 directory (docs) https://github.com/cortexlabs/cortex/pull/1428 https://github.com/cortexlabs/cortex/issues/1252 (RobertLucian)
- Inter-process fairness: distribute requests within an API replica evenly across all processes https://github.com/cortexlabs/cortex/pull/1526 https://github.com/cortexlabs/cortex/issues/839 https://github.com/cortexlabs/cortex/issues/1298 (RobertLucian)
- Support requests between APIs within the same cluster (docs) https://github.com/cortexlabs/cortex/pull/1503 https://github.com/cortexlabs/cortex/issues/1241 (deliahu)
- Allow overriding of CLI install path and config directory (via
$CORTEX_INSTALL_PATHand$CORTEX_CLI_CONFIG_DIR) (docs) https://github.com/cortexlabs/cortex/pull/1521 https://github.com/cortexlabs/cortex/issues/1222 (deliahu)
Breaking changes
- ONNX model paths in API configuration files must now point to a directory containing a single ONNX file, rather than the onnx file itself. For example
model_path: s3://cortex-examples/onnx/yolov5-youtube/yolov5s.onnxbecomesmodel_path: s3://cortex-examples/onnx/yolov5-youtube. - The
--env/-eflag in allcortex clustercommands has been renamed to--configure-env/-e, and if not provided, the environment namedawswill no longer be configured in thecortex cluster infocommand
Bug fixes
- Fix intermittent failed requests during rolling updates https://github.com/cortexlabs/cortex/pull/1526 https://github.com/cortexlabs/cortex/issues/814 (RobertLucian)
- Prevent CLI environments from getting overwritten when multiple
cortex clustercommands are run concurrently https://github.com/cortexlabs/cortex/pull/1520 https://github.com/cortexlabs/cortex/issues/1410 (deliahu)
Docs
- Add Python client docs https://github.com/cortexlabs/cortex/pull/1519 https://github.com/cortexlabs/cortex/issues/1502 (deliahu)
- Add guide for running in production https://github.com/cortexlabs/cortex/pull/1513 https://github.com/cortexlabs/cortex/issues/1464 https://github.com/cortexlabs/cortex/issues/1257 (deliahu)
- Add guide for low-cost clusters https://github.com/cortexlabs/cortex/pull/1514 https://github.com/cortexlabs/cortex/issues/1425 (deliahu)
- Add guide for using a REST API Gateway https://github.com/cortexlabs/cortex/pull/1505 https://github.com/cortexlabs/cortex/issues/1228 (deliahu)
- Add guide for troubleshooting
cortex cluster downfailures https://github.com/cortexlabs/cortex/pull/1515 https://github.com/cortexlabs/cortex/issues/1319 (deliahu)
Misc
- Stagger Predictor
__init__()calls to reduce peak memory consumption https://github.com/cortexlabs/cortex/pull/1543 https://github.com/cortexlabs/cortex/issues/1450 (RobertLucian) - Add
--name/-nand--region/-rflags tocortex cluster info,cortex cluster export, andcortex cluster downcommands https://github.com/cortexlabs/cortex/pull/1492 https://github.com/cortexlabs/cortex/issues/1363 (RobertLucian) - Rename
--env/-eflag to--configure-env/-eincortex clustercommands and update its behavior https://github.com/cortexlabs/cortex/pull/1533 https://github.com/cortexlabs/cortex/issues/1412 (deliahu) - Disallow ARM-based instances, which are not currently supported https://github.com/cortexlabs/cortex/pull/1536 (deliahu)
- Validate AWS vCPU quota is sufficient for up to
max_instancesinstances when runningcortex cluster upandcortex cluster configurehttps://github.com/cortexlabs/cortex/pull/1537 https://github.com/cortexlabs/cortex/issues/1461 (deliahu)
- Go
Published by deliahu over 5 years ago
cortex - v0.21.0
New features
- Add Python client: pypi.org/project/cortex https://github.com/cortexlabs/cortex/pull/1449 https://github.com/cortexlabs/cortex/issues/684 (vishalbollu)
- Add support for private docker image registries (docs) https://github.com/cortexlabs/cortex/pull/1460 https://github.com/cortexlabs/cortex/issues/1113 (deliahu)
Bug fixes
- Fix minor BatchAPI bugs https://github.com/cortexlabs/cortex/pull/1471 https://github.com/cortexlabs/cortex/pull/1468 https://github.com/cortexlabs/cortex/pull/1480 https://github.com/cortexlabs/cortex/issues/1473 (vishalbollu, RobertLucian)
- Bypass instance limit check if AWS's API doesn't provide quota information (this was blocking cluster creation in
eu-north-1) https://github.com/cortexlabs/cortex/pull/1439 https://github.com/cortexlabs/cortex/issues/1438 (deliahu)
Docs
- Add a guide for how to install the CLI on Windows https://github.com/cortexlabs/cortex/pull/1476 https://github.com/cortexlabs/cortex/issues/715 (RobertLucian)
Misc
- Change default local port from 8888 to 8890 to avoid port conflicts with Jupyter https://github.com/cortexlabs/cortex/pull/1456 (vishalbollu)
- Disallow instance types that aren't supported by NLB https://github.com/cortexlabs/cortex/pull/1436 https://github.com/cortexlabs/cortex/issues/1433 (deliahu)
- Add
--cluster-aws-keyand--cluster-aws-secretflags tocortex cluster configurecommand https://github.com/cortexlabs/cortex/pull/1404 (deliahu) - Add
--outputflag tocortex env listcommand https://github.com/cortexlabs/cortex/pull/1444 (vishalbollu)
- Go
Published by vishalbollu over 5 years ago
cortex - v0.20.0
v0.20.0
New features
- Add
cortex cluster exportcommand to export all APIs running in a cluster (docs) https://github.com/cortexlabs/cortex/pull/1368 https://github.com/cortexlabs/cortex/issues/1255 (vishalbollu) - Enable users to specify CIDR ranges for the cluster's VPC (docs) https://github.com/cortexlabs/cortex/pull/1388 (vishalbollu)
- Support json output for CLI commands (via
-o/--output json) https://github.com/cortexlabs/cortex/pull/1365 https://github.com/cortexlabs/cortex/issues/1161 (vishalbollu) - Support the nvidia device driver (nvidia-container-toolkit) when running locally https://github.com/cortexlabs/cortex/pull/1366 https://github.com/cortexlabs/cortex/issues/1223 (vishalbollu)
Breaking changes
- The valid values for
api_gatewayin the cluster configuration file have been changed fromenabled/disabledtopublic/none(to match the values fornetworking.api_gatewayin the API configuration file).
Bug fixes
- Support AWS tags with spaces and valid special characters https://github.com/cortexlabs/cortex/pull/1374 https://github.com/cortexlabs/cortex/pull/1355 https://github.com/cortexlabs/cortex/pull/1380 https://github.com/cortexlabs/cortex/pull/1385 https://github.com/cortexlabs/cortex/issues/1373 (deliahu)
- Fix tensor shape validation for the TensorFlow predictor https://github.com/cortexlabs/cortex/pull/1311 https://github.com/cortexlabs/cortex/issues/1310 (RobertLucian)
- Allow
cortex cluster *commands to be run from within a docker container https://github.com/cortexlabs/cortex/pull/1370 https://github.com/cortexlabs/cortex/issues/1361 https://github.com/cortexlabs/cortex/issues/1325 (deliahu)
New examples
- pytorch/question-generator to generate questions given text and the correct answer (uses transformers and spacy) https://github.com/cortexlabs/cortex/pull/1308 (ismaelc)
Docs
- Add documentation for how to install a specific version of the CLI https://github.com/cortexlabs/cortex/pull/1386 https://github.com/cortexlabs/cortex/issues/1244 (vishalbollu)
- Add sections for overprovisioning and responsiveness to autoscaling docs https://github.com/cortexlabs/cortex/pull/1397 (deliahu)
- Add documentation for how to allow IAM users who did not create the cortex cluster to run
cortex cluster *commands https://github.com/cortexlabs/cortex/pull/1392 https://github.com/cortexlabs/cortex/issues/1391 (deliahu) - Add guide for setting up
kubectlto access the cluster https://github.com/cortexlabs/cortex/pull/1344 https://github.com/cortexlabs/cortex/issues/1343 (RobertLucian)
Misc
- Update sources of AWS credentials for
cortex cluster *commands, and improve transparency (docs) https://github.com/cortexlabs/cortex/pull/1378 https://github.com/cortexlabs/cortex/issues/1229 (vishalbollu) - Rename cluster
api_gatewayconfig values to match API config https://github.com/cortexlabs/cortex/pull/1335 https://github.com/cortexlabs/cortex/issues/1334 (deliahu) - Set the default value for
networking.api_gatewayin the API configuration tononeif api gateway is disabled cluster-wide https://github.com/cortexlabs/cortex/pull/1337 https://github.com/cortexlabs/cortex/issues/1336 (deliahu) - Support c6g and r6g instances https://github.com/cortexlabs/cortex/pull/1332 https://github.com/cortexlabs/cortex/issues/809 (deliahu)
- Display autoscaling group activity history when
cortex cluster upfails https://github.com/cortexlabs/cortex/pull/1342 https://github.com/cortexlabs/cortex/issues/1340 (deliahu) - Print debug info if
cortex cluster uptimes out https://github.com/cortexlabs/cortex/pull/1396 (deliahu) - Add Inferentia compute statistics to
cortex cluster infocommand https://github.com/cortexlabs/cortex/pull/1354 https://github.com/cortexlabs/cortex/issues/1304 (RobertLucian) - Disable prompts in
get-cli.shif not running interactively https://github.com/cortexlabs/cortex/pull/1372 https://github.com/cortexlabs/cortex/issues/1371 (deliahu) - Update
cortex helpoutput https://github.com/cortexlabs/cortex/pull/1398 (deliahu)
- Go
Published by RobertLucian over 5 years ago
cortex - v0.19.0
New features
- Support batch APIs docs https://github.com/cortexlabs/cortex/pull/1203 https://github.com/cortexlabs/cortex/issues/523 (vishalbollu)
- Support traffic splitting (enables A/B testing, multi-armed bandit, etc) docs https://github.com/cortexlabs/cortex/pull/1213 https://github.com/cortexlabs/cortex/pull/1270 https://github.com/cortexlabs/cortex/issues/1132 https://github.com/cortexlabs/cortex/issues/275 https://github.com/cortexlabs/cortex/issues/1089 (tthebst)
- Support server-side request batching for the TensorFlow Predictor docs https://github.com/cortexlabs/cortex/pull/1193 https://github.com/cortexlabs/cortex/issues/1060 (RobertLucian)
- Add
post_predict()method to Predictor interface (runs after the response has been sent) docs https://github.com/cortexlabs/cortex/pull/1237 https://github.com/cortexlabs/cortex/issues/954 (RobertLucian) - Support disabling API Gateway cluster-wide docs https://github.com/cortexlabs/cortex/pull/1259 https://github.com/cortexlabs/cortex/issues/1198 (deliahu)
- Support different CUDA versions for the slim Python Predictor image docs https://github.com/cortexlabs/cortex/pull/1263 https://github.com/cortexlabs/cortex/issues/923 https://github.com/cortexlabs/cortex/issues/1254 (RobertLucian)
- Add additional widgets to the CloudWatch Dashboard (avg in-flight requests per replica, active replicas) docs https://github.com/cortexlabs/cortex/pull/1181 (RobertLucian)
Breaking changes
kindis now a required top-level field for all API configurations. Existing APIs should addkind: RealtimeAPI. This release adds support forkind: BatchAPIandkind: TrafficSplitter.
Bug fixes
- Fix
python_pathconfig field https://github.com/cortexlabs/cortex/pull/1202 (deliahu) - Fix local TensorFlow deploy from parent directory https://github.com/cortexlabs/cortex/pull/1274 (deliahu)
- Improve error response for invalid payloads https://github.com/cortexlabs/cortex/pull/1212 https://github.com/cortexlabs/cortex/issues/1208 (RobertLucian)
New examples
- onnx/yolov5-youtube https://github.com/cortexlabs/cortex/pull/1201 (dsuess)
- Update PyTorch text generator example to use Hugging Face transfomers GPT-2 model https://github.com/cortexlabs/cortex/pull/1177 (ospillinger)
Docs
- Update tutorial to use the pytorch text-generator example https://github.com/cortexlabs/cortex/pull/1278 https://github.com/cortexlabs/cortex/issues/1256 (deliahu)
- Improve instructions for updating cluster without downtime https://github.com/cortexlabs/cortex/pull/1261 (deliahu)
- Mention API Gateway timeout in 404/503 API responses guide https://github.com/cortexlabs/cortex/pull/1264 https://github.com/cortexlabs/cortex/issues/1225 (deliahu)
Misc
- Set tags on log groups https://github.com/cortexlabs/cortex/pull/1164 https://github.com/cortexlabs/cortex/issues/1078 (tthebst)
- Display API metrics in the CLI by API ID (rather than by API name) https://github.com/cortexlabs/cortex/pull/1216 (vishalbollu)
- Fix recursive error message for deploy/delete CLI commands https://github.com/cortexlabs/cortex/pull/1247 https://github.com/cortexlabs/cortex/issues/1218 (RobertLucian)
- Add shell completion to .zshrc file during CLI installation https://github.com/cortexlabs/cortex/pull/1265 https://github.com/cortexlabs/cortex/issues/1221 (deliahu)
- Handle OOM error when project files are too large https://github.com/cortexlabs/cortex/pull/1217 (RobertLucian)
- Display image pull errors https://github.com/cortexlabs/cortex/pull/1167 https://github.com/cortexlabs/cortex/issues/955 (deliahu)
- Display local Docker image pull error when out of space https://github.com/cortexlabs/cortex/pull/1238 https://github.com/cortexlabs/cortex/issues/1236 (zouyee)
- Go
Published by vishalbollu over 5 years ago
cortex - v0.18.1
Bug fixes
- Fix dynamic axes for ONNX models https://github.com/cortexlabs/cortex/pull/1187 https://github.com/cortexlabs/cortex/issues/1186 (RobertLucian)
- Fix memory node capacity calculation for multi-api configuration files https://github.com/cortexlabs/cortex/pull/1185 (deliahu)
- Check cluster-name tag when choosing load balancer for VPC Link integration https://github.com/cortexlabs/cortex/pull/1173 (deliahu)
New guides
- Troubleshooting: API request errors (deliahu)
- Troubleshooting: TensorFlow session in predict() (RobertLucian)
Misc
- Delete API Gateway if
cluster upfails https://github.com/cortexlabs/cortex/pull/1172 (deliahu) - Move image version verification from serve.py to run.sh https://github.com/cortexlabs/cortex/pull/1180 https://github.com/cortexlabs/cortex/pull/1183 (vishalbollu)
- Add retries for resource tagging during
cluster uphttps://github.com/cortexlabs/cortex/pull/1188 (deliahu) - Use info log level when TensorFlow model is being loaded https://github.com/cortexlabs/cortex/pull/1171 (RobertLucian)
- Increase max number of processes per API replica to 100 https://github.com/cortexlabs/cortex/pull/1166 (RobertLucian)
- Allow empty cluster config https://github.com/cortexlabs/cortex/pull/1179 (deliahu)
- Go
Published by deliahu over 5 years ago
cortex - v0.18.0
New features
- Support Inferentia instances https://github.com/cortexlabs/cortex/pull/1119 https://github.com/cortexlabs/cortex/issues/654 (RobertLucian)
- Automatically provision HTTPS API Gateway endpoints for Cortex APIs https://github.com/cortexlabs/cortex/pull/1108 https://github.com/cortexlabs/cortex/issues/1077 (tthebst)
- Support multi-model endpoints for TensorFlow and ONNX predictors https://github.com/cortexlabs/cortex/pull/1107 https://github.com/cortexlabs/cortex/issues/890 (RobertLucian)
- Support local Docker images in the local environment https://github.com/cortexlabs/cortex/pull/1114 https://github.com/cortexlabs/cortex/issues/1094 (RobertLucian)
- Support replica parallelism fields (
processes_per_replicaandthreads_per_process) in the local environment https://github.com/cortexlabs/cortex/pull/1158 https://github.com/cortexlabs/cortex/issues/960 https://github.com/cortexlabs/cortex/issues/1090 (RobertLucian) - Support a
.envfile to export environment variables in the API container https://github.com/cortexlabs/cortex/pull/1154 https://github.com/cortexlabs/cortex/issues/1147 (RobertLucian, spentaur)
Breaking changes
autoscaling.workers_per_replicaandautoscaling.threads_per_workerhave been moved/renamed topredictor.processes_per_replicaandpredictor.threads_per_process(see API configuration docs)endpointandlocal_porthave been moved to a new sub-field callednetworking(see API configuration docs)modelhas been renamed tomodel_pathin TensorFlow and ONNX predictors (see API configuration docs)
Bug fixes
- Prevent GPU overprovisioning during autoscaling https://github.com/cortexlabs/cortex/pull/1111 https://github.com/cortexlabs/cortex/issues/1085 (vishalbollu)
New examples
- tensorflow/image-classifier-resnet50 and pytorch/image-classifier-resnet50 to demonstrate using Inferentia https://github.com/cortexlabs/cortex/pull/1119 (RobertLucian)
- pytorch/multi-model-text-analyzer, tensorflow/multi-model-classifier, and onnx/multi-model-classifier to demonstrate multi-model APIs https://github.com/cortexlabs/cortex/pull/1107 (RobertLucian)
New guides
- Multi-model endpoints using TensorFlow and ONNX predictors https://github.com/cortexlabs/cortex/pull/1107 (RobertLucian)
Docs
- Add API architecture diagram https://github.com/cortexlabs/cortex/pull/1126 (deliahu)
- Add documentation for configuring the CLI on a new machine https://github.com/cortexlabs/cortex/pull/1127 (deliahu, javithe7)
Misc
- Call predictor
__init__()from the request's threadpool to avoid mutlithreadding issues in some ML frameworks when using 1 thread https://github.com/cortexlabs/cortex/pull/1146 (deliahu) - Allow changing an API's endpoint to not trigger a rolling replica update https://github.com/cortexlabs/cortex/pull/1155 https://github.com/cortexlabs/cortex/issues/1116 (deliahu)
- Set the default shell to
bashin Predictor Dockerfiles (which simplifies using them as base images for custom-built images) https://github.com/cortexlabs/cortex/pull/1104 https://github.com/cortexlabs/cortex/issues/1086 (RobertLucian) - Move
endpointandlocal_porttonetworkingAPI config https://github.com/cortexlabs/cortex/pull/1151 https://github.com/cortexlabs/cortex/issues/1091 (deliahu) - Rename
modeltomodel_pathin API config https://github.com/cortexlabs/cortex/pull/1150 https://github.com/cortexlabs/cortex/issues/1115 (deliahu) - Use cluster name for cloudwatch metrics namespace https://github.com/cortexlabs/cortex/pull/1138 (deliahu)
- Misc UI improvements https://github.com/cortexlabs/cortex/pull/1159 https://github.com/cortexlabs/cortex/pull/1084 https://github.com/cortexlabs/cortex/pull/1152 https://github.com/cortexlabs/cortex/pull/1136 https://github.com/cortexlabs/cortex/pull/1128 https://github.com/cortexlabs/cortex/commit/a52f0a0a7871253c8fdd13eac7116cf4b2e961c1 https://github.com/cortexlabs/cortex/pull/1096 (deliahu)
- Go
Published by deliahu over 5 years ago
cortex - v0.17.1
Bug fixes
- Fix bug which caused installation of packages from
conda-packages.txtto get stuck when running Cortex locally https://github.com/cortexlabs/cortex/pull/1101 https://github.com/cortexlabs/cortex/issues/1100 (dsuess)
New guides
- Multi-model endpoints using the Python predictor https://github.com/cortexlabs/cortex/pull/1081 (deliahu)
Docs
- Improve tutorial https://github.com/cortexlabs/cortex/pull/1099 https://github.com/cortexlabs/cortex/issues/1098 (ssami)
- Misc docs improvements https://github.com/cortexlabs/cortex/pull/1095 https://github.com/cortexlabs/cortex/commit/e15cc62ddf49af3d41259f56a21adc6e8dabcbc3 https://github.com/cortexlabs/cortex/commit/8023ff98241b85e1d05770c52981721fb8db735f (deliahu, vishalbollu)
- Go
Published by deliahu over 5 years ago
cortex - v0.17.0
New features
- Support arbitrary API request payload content types, including raw bytes and form fields (not just JSON) https://github.com/cortexlabs/cortex/pull/1062, https://github.com/cortexlabs/cortex/issues/332, https://github.com/cortexlabs/cortex/issues/917 (deliahu)
- Support custom SSL certificates for the API load balancer https://github.com/cortexlabs/cortex/pull/1069, https://github.com/cortexlabs/cortex/issues/326, https://github.com/cortexlabs/cortex/issues/1066 (vishalbollu)
- Add a cloudwatch dashboard to show metrics for each running API https://github.com/cortexlabs/cortex/pull/1054, https://github.com/cortexlabs/cortex/issues/855 (tthebst)
- Allow for custom tagging of AWS resources created by cortex (and add the
cortex.dev/cluster-nametag by default) https://github.com/cortexlabs/cortex/pull/1031, https://github.com/cortexlabs/cortex/issues/854, https://github.com/cortexlabs/cortex/issues/856 (vishalbollu) - Expose request query parameters to the predictor's
predict()function https://github.com/cortexlabs/cortex/pull/1062, https://github.com/cortexlabs/cortex/issues/546 (deliahu) - Expose request headers to the predictor's
predict()function https://github.com/cortexlabs/cortex/pull/1062 (deliahu) - Allow users to change the Python version via
conda-packages.txthttps://github.com/cortexlabs/cortex/pull/1052, https://github.com/cortexlabs/cortex/issues/1051 (RobertLucian)
Bug fixes
- Fix bug which caused a validation error when running TensorFlow or ONNX locally with locally saved models https://github.com/cortexlabs/cortex/pull/1075 (RobertLucian)
- Enable tty on local docker API containers to avoid strange characters showing up in
cortex logshttps://github.com/cortexlabs/cortex/pull/1067 (deliahu) - Allow cluster's
min_instancesandmax_instancesto be updated at the same time https://github.com/cortexlabs/cortex/pull/1050, https://github.com/cortexlabs/cortex/issues/840 (deliahu)
New examples
- Update pytorch/sentiment-analyzer to use the
transformerslibrary https://github.com/cortexlabs/cortex/pull/1064 (deliahu)
New guides
- Viewing API metrics on the CloudWatch dashboard (deliahu)
Docs
- Add architecture diagram https://github.com/cortexlabs/cortex/pull/1042, https://github.com/cortexlabs/cortex/issues/1013 (deliahu)
- Document how to install packages from private PyPI indexes https://github.com/cortexlabs/cortex/pull/1072 (RobertLucian)
Misc
- Add zsh completion https://github.com/cortexlabs/cortex/pull/1024, https://github.com/cortexlabs/cortex/issues/1020 (deliahu)
- Rename
cortex cluster updatetocortex cluster configurehttps://github.com/cortexlabs/cortex/pull/1035, https://github.com/cortexlabs/cortex/issues/887 (zouyee) - Add instance and pricing information to
cortex cluster infooutput https://github.com/cortexlabs/cortex/pull/1053, https://github.com/cortexlabs/cortex/issues/835, https://github.com/cortexlabs/cortex/issues/935 (deliahu) - Rename tracker to monitoring https://github.com/cortexlabs/cortex/pull/1041, https://github.com/cortexlabs/cortex/issues/869 (deliahu)
- Change the default cortex region to us-east-1 https://github.com/cortexlabs/cortex/pull/1063 (deliahu)
- Disable cluster logging https://github.com/cortexlabs/cortex/pull/1029, https://github.com/cortexlabs/cortex/issues/888 (deliahu)
- Add
gitto API images https://github.com/cortexlabs/cortex/pull/1068 (RobertLucian) - Add upper bound validation for maxreplicaconcurrency https://github.com/cortexlabs/cortex/pull/1025 (zouyee)
- Misc UI/UX improvements https://github.com/cortexlabs/cortex/pull/1032, https://github.com/cortexlabs/cortex/pull/1023, https://github.com/cortexlabs/cortex/pull/1033, https://github.com/cortexlabs/cortex/pull/1036, https://github.com/cortexlabs/cortex/pull/1045, https://github.com/cortexlabs/cortex/pull/1047, https://github.com/cortexlabs/cortex/pull/1049, https://github.com/cortexlabs/cortex/pull/1044, https://github.com/cortexlabs/cortex/commit/93032e2e54c9a1876264c98b724fc29f3faf5a4d, https://github.com/cortexlabs/cortex/pull/1065, https://github.com/cortexlabs/cortex/issues/726, https://github.com/cortexlabs/cortex/issues/1048, https://github.com/cortexlabs/cortex/issues/894, https://github.com/cortexlabs/cortex/issues/1043, https://github.com/cortexlabs/cortex/issues/1028, https://github.com/cortexlabs/cortex/issues/933, https://github.com/cortexlabs/cortex/issues/1027, https://github.com/cortexlabs/cortex/issues/934, https://github.com/cortexlabs/cortex/issues/1026 (deliahu, vishalbollu)
Breaking changes
cortex cluster updatehas been renamed tocortex cluster configure(so it won't be misinterpreted as updating the Cortex version of the cluster)- the
trackerfield in API configuration has been renamed tomonitoring(to be consistent with the other field names)
- Go
Published by deliahu almost 6 years ago
cortex - v0.16.1
Bug fixes
- Read and validate TensorFlow and ONNX models from buckets in any region https://github.com/cortexlabs/cortex/pull/1059 (vishalbollu)
- Go
Published by deliahu almost 6 years ago
cortex - v0.16.0
New features
- Support deploying APIs locally https://github.com/cortexlabs/cortex/pull/973 https://github.com/cortexlabs/cortex/issues/109 (vishalbollu)
- Enable private networking: private subnets for instances, internal API load balancer, and internal operator load balancer https://github.com/cortexlabs/cortex/pull/978 https://github.com/cortexlabs/cortex/issues/965 https://github.com/cortexlabs/cortex/issues/832 https://github.com/cortexlabs/cortex/issues/964 (deliahu)
- Support installing system packages via
dependencies.shhttps://github.com/cortexlabs/cortex/pull/880 https://github.com/cortexlabs/cortex/issues/852 (RobertLucian) - Support installing conda packages via
conda-packages.txthttps://github.com/cortexlabs/cortex/pull/880 https://github.com/cortexlabs/cortex/issues/844 (RobertLucian) - Allow for spot instances to be used with a single instance type https://github.com/cortexlabs/cortex/pull/979 https://github.com/cortexlabs/cortex/issues/886 (RobertLucian)
- Support specifying serving images in API configuration (on a per-API basis) https://github.com/cortexlabs/cortex/pull/948 https://github.com/cortexlabs/cortex/issues/900 (RobertLucian)
- Add cortex commands to list and remove CLI environments https://github.com/cortexlabs/cortex/pull/973 https://github.com/cortexlabs/cortex/issues/730 (deliahu)
- Support
bytesandstarlette.responses.Responseresponse types frompredict()https://github.com/cortexlabs/cortex/pull/915 https://github.com/cortexlabs/cortex/issues/913 (RobertLucian) - Add slim predictor base images https://github.com/cortexlabs/cortex/pull/992 https://github.com/cortexlabs/cortex/issues/781 (deliahu)
- Support configuring instance volume type and provisioned IOPS https://github.com/cortexlabs/cortex/pull/982 https://github.com/cortexlabs/cortex/issues/592 (tthebst)
- Support highly available NAT Gateway https://github.com/cortexlabs/cortex/pull/978 https://github.com/cortexlabs/cortex/issues/963 (deliahu)
- Add --yes flag to skip prompts on cluster CLI commands https://github.com/cortexlabs/cortex/pull/980 https://github.com/cortexlabs/cortex/issues/929 (deliahu)
New Examples
- Bart summarizer (using PyTorch) https://github.com/cortexlabs/cortex/pull/907 (ismaelc)
- Named entity recognizer (using spacy) https://github.com/cortexlabs/cortex/pull/924 (aced125)
- Lite version of the license plate reader https://github.com/cortexlabs/cortex/pull/994 (RobertLucian)
New Guides
- Set up AWS API gateway (deliahu)
- Plot response code counts (deliahu)
- Plot API request time (deliahu)
- Plot in-flight requests (deliahu)
- Set up VPC peering (deliahu)
- SSH into AWS instance (RobertLucian)
Docs
Misc
- Use rolling updates for daemonsets when running
cortex cluster updatehttps://github.com/cortexlabs/cortex/pull/972 https://github.com/cortexlabs/cortex/issues/630 (tthebst) - Switch from Classic Elastic Load Balancer to Network Load Balancer https://github.com/cortexlabs/cortex/pull/978 https://github.com/cortexlabs/cortex/issues/966 (deliahu)
- Show the original error message when encountering "invalid AWS credentials" https://github.com/cortexlabs/cortex/pull/918 (deliahu)
- Log unexpected TensorFlow Serving gRPC errors https://github.com/cortexlabs/cortex/pull/949 (deliahu)
Breaking Changes
- Previously, custom serving images were configured in the cluster configuration file (e.g.
cluster.yamlwould have a line forimage_python_serve: my-repo/python-serve:latest). Now, custom images are specified inside the API configuration (e.g.cortex.yamlshould haveimage: my-repo/python-serve:latestin thepredictorsection of your API configuration). Here's the full documentation for API configuration. - The names of the serving base images have been updated to be more descriptive, and "slim" images have been added (they are more appropriate to use as base images when building custom images). Here is the full documentation for custom Docker images.
- The
debugquery parameter to APIs (which caused the input and output topredict()to be logged) has been removed (https://github.com/cortexlabs/cortex/pull/985)
- Go
Published by deliahu almost 6 years ago
cortex - v0.15.1
Bug fixes
- Fix admin check for root AWS users https://github.com/cortexlabs/cortex/pull/940 (deliahu)
- Fix CORS https://github.com/cortexlabs/cortex/pull/942 (vishalbollu)
Misc
- Print panic stack traces https://github.com/cortexlabs/cortex/pull/921 (deliahu)
- Go
Published by deliahu almost 6 years ago
cortex - v0.15.0
Bug fixes
- Improve availability zone selection and validation https://github.com/cortexlabs/cortex/pull/885 https://github.com/cortexlabs/cortex/pull/891 (deliahu)
- Validate predictor implementation using
getfullargspec()https://github.com/cortexlabs/cortex/pull/902 (vishalbollu) - Do not remove cluster configuration cache if refresh fails https://github.com/cortexlabs/cortex/pull/893 (vishalbollu)
New Examples
- Keras autoencoder for filtering out the noise from text documents https://github.com/cortexlabs/cortex/pull/834 (RobertLucian)
Misc
- Check cluster status before executing cluster commands https://github.com/cortexlabs/cortex/pull/881 https://github.com/cortexlabs/cortex/pull/879 https://github.com/cortexlabs/cortex/pull/892 (vishalbollu)
- Check for AWS Administrator IAM access in
cluster upandcluster downcommands https://github.com/cortexlabs/cortex/pull/878 (deliahu) - Wait for cloudformation stacks to delete during
cluster downhttps://github.com/cortexlabs/cortex/pull/876 (vishalbollu) - Verify cortex operator url during
cortex configurehttps://github.com/cortexlabs/cortex/pull/877 (vishalbollu) - Allow blank bucket values in
cluster.yamlhttps://github.com/cortexlabs/cortex/pull/875 (vishalbollu) - Improve various error messages https://github.com/cortexlabs/cortex/pull/895 https://github.com/cortexlabs/cortex/pull/896 https://github.com/cortexlabs/cortex/pull/897 https://github.com/cortexlabs/cortex/pull/899 https://github.com/cortexlabs/cortex/pull/905 (deliahu)
- Improve documentation https://github.com/cortexlabs/cortex/pull/861 https://github.com/cortexlabs/cortex/pull/853 https://github.com/cortexlabs/cortex/pull/851 https://github.com/cortexlabs/cortex/pull/868 https://github.com/cortexlabs/cortex/pull/870 https://github.com/cortexlabs/cortex/pull/871, https://github.com/cortexlabs/cortex/pull/872, https://github.com/cortexlabs/cortex/commit/ac481b93e2254d4753201909fdd8063df3484661 https://github.com/cortexlabs/cortex/commit/3ad39039ca366e240a83779845bae6d3f0d4b853 (deliahu, vishalbollu, RobertLucian)
Breaking changes
- Remove
json_tricksfor encoding API responses (responses frompredict()must now be json serializable) https://github.com/cortexlabs/cortex/pull/908 (vishalbollu)
- Go
Published by deliahu almost 6 years ago
cortex - v0.14.1
Bug fixes
* Fix JSON parsing before it gets passed to predict() https://github.com/cortexlabs/cortex/pull/865 (vishalbollu)
* Support setup.py packages in requirements.txt https://github.com/cortexlabs/cortex/pull/864 (deliahu)
* Run TensorFlow Predictor's model validations in the region that contains the bucket https://github.com/cortexlabs/cortex/pull/866 (deliahu)
Misc * Check that cluster image versions match the CLI version https://github.com/cortexlabs/cortex/pull/862 (deliahu) * Add Cython to Python predictor image https://github.com/cortexlabs/cortex/commit/4a463cab809e67ae257b2ac5a41ee94a7cdbce93 (deliahu)
- Go
Published by vishalbollu almost 6 years ago
cortex - v0.14.0
New features
- Support request-based autoscaling https://github.com/cortexlabs/cortex/pull/815 https://github.com/cortexlabs/cortex/pull/838 https://github.com/cortexlabs/cortex/issues/573 (vishalbollu, deliahu)
- Support fine-grained configuration for autoscaling algorithm behavior https://github.com/cortexlabs/cortex/pull/815 (deliahu)
- Support configurable in-replica parallelism (i.e. workers, threads) https://github.com/cortexlabs/cortex/pull/838 https://github.com/cortexlabs/cortex/issues/590 (vishalbollu, deliahu)
- Support configurable request queue length https://github.com/cortexlabs/cortex/pull/838 https://github.com/cortexlabs/cortex/issues/646 (vishalbollu)
- Support .cortexignore file to exclude files/directories from Cortex project zip https://github.com/cortexlabs/cortex/pull/800 https://github.com/cortexlabs/cortex/issues/723 (wingkwong)
Bug fixes
- Ensure previous logs are never shown after showing newer ones https://github.com/cortexlabs/cortex/pull/792 (deliahu)
- Skip service quota validation in unsupported regions https://github.com/cortexlabs/cortex/pull/825 (deliahu)
- Fix prediction metrics when specifying tracker.key https://github.com/cortexlabs/cortex/pull/793 (deliahu)
New Examples
- Real-Time License Plate Detector Example Project (YOLOv3, CRAFT, CRNN) https://github.com/cortexlabs/cortex/pull/803 (RobertLucian)
Misc
- Show a warning if AWS session token is detected https://github.com/cortexlabs/cortex/pull/842 (chrisranderson, vishalbollu)
- Disable NAT gateway https://github.com/cortexlabs/cortex/pull/808 (deliahu)
- Add debug information to cluster error messages https://github.com/cortexlabs/cortex/commit/5a1a2bcba5d5634eb3fd8fd07c64f9556df386e0 https://github.com/cortexlabs/cortex/pull/850 (deliahu)
- Add cluster costs to README https://github.com/cortexlabs/cortex/pull/807 https://github.com/cortexlabs/cortex/issues/806 (bcjordan, deliahu)
- Document which system packages are installed in Docker images https://github.com/cortexlabs/cortex/pull/847 https://github.com/cortexlabs/cortex/issues/822 (deliahu)
- Update pytorch examples to use GPU https://github.com/cortexlabs/cortex/pull/849 (vishalbollu)
- Install
libsndfile1in API Docker images https://github.com/cortexlabs/cortex/pull/826 (deliahu)
Breaking API Changes
min_replicas,max_replicas, andinit_replicashave been moved from thecomputeconfiguration key toautoscalingmax_surgeandmax_unavailablehave been moved from thecomputeconfiguration key toupdate_strategytarget_cpu_utilizationhas been removed in favor of the request-based scaling configuration parameters (see the autoscaling docs for a detailed explanation of the new parameters)
- Go
Published by deliahu almost 6 years ago
cortex - v0.13.1
Bug fixes
- Fix
cortex cluster updatewhen using spot instances with no on-demand backup nodegroup https://github.com/cortexlabs/cortex/pull/787 (vishalbollu)
Misc
- Set locale to en_US.UTF-8 https://github.com/cortexlabs/cortex/pull/784 (deliahu, RobertLucian)
- Cause replica to error when pip install fails https://github.com/cortexlabs/cortex/commit/394862b0f9ce303f71ce9d18996c13350d47e2c5 (deliahu)
- Query EKS price from AWS Pricing API https://github.com/cortexlabs/cortex/pull/783 (deliahu)
- Assert API version before inspecting args in python https://github.com/cortexlabs/cortex/pull/789 (vishalbollu)
- Improve healthcheck https://github.com/cortexlabs/cortex/pull/788 (vishalbollu)
- Go
Published by deliahu about 6 years ago
cortex - v0.13.0
New features
- Support on-demand instance backup when spot instances are not available https://github.com/cortexlabs/cortex/pull/745 https://github.com/cortexlabs/cortex/issues/629 (vishalbollu)
- Remove
kind: deploymentfrom API configuration https://github.com/cortexlabs/cortex/pull/759 (deliahu, vishalbollu) - Add
cortex refresh <api_name>command https://github.com/cortexlabs/cortex/pull/759 https://github.com/cortexlabs/cortex/issues/758 (deliahu) - Update
cortex delete <api_name>command https://github.com/cortexlabs/cortex/pull/759 (deliahu) - Add configuration for rolling update strategy (
max_surgeandmax_unavailable) https://github.com/cortexlabs/cortex/pull/763 (deliahu) - Support programatic CLI configuration via command line flags https://github.com/cortexlabs/cortex/pull/764 https://github.com/cortexlabs/cortex/issues/729 (deliahu)
- Support small instance types https://github.com/cortexlabs/cortex/pull/720 (deliahu)
- Add env flag to
cluster upandcluster updatecommands https://github.com/cortexlabs/cortex/pull/731 (deliahu)
Bug fixes
- Limit cluster growth rate to avoid Kubernetes API server crashes https://github.com/cortexlabs/cortex/pull/769 (vishalbollu)
- Use configured max price for filtering spot instance distribution https://github.com/cortexlabs/cortex/pull/746 https://github.com/cortexlabs/cortex/issues/719 (vishalbollu)
- Disallow nano and mirco instances https://github.com/cortexlabs/cortex/commit/84f09375a7eb9d817e7be40e246cb190b9eb8393 https://github.com/cortexlabs/cortex/issues/755 (deliahu)
- Fix pod status calculation to classify successfully recovered replicas as ready https://github.com/cortexlabs/cortex/commit/c5d97eb2156592ebaa73c3481404bf4a496dbcf1 https://github.com/cortexlabs/cortex/issues/738 (deliahu)
New Examples
- Object detection in images with R-CNN https://github.com/cortexlabs/cortex/pull/754 (ArkinDharawat)
- Fastai https://github.com/cortexlabs/cortex/pull/725 (caleb-kaiser)
Misc
- Add total cluster price to installation confirmation message https://github.com/cortexlabs/cortex/pull/714 https://github.com/cortexlabs/cortex/pull/775 https://github.com/cortexlabs/cortex/issues/713 (deliahu, vishalbollu)
- Prompt before attempting to zip large files, many files, or large total folder size https://github.com/cortexlabs/cortex/pull/752 https://github.com/cortexlabs/cortex/pull/767 https://github.com/cortexlabs/cortex/issues/721 https://github.com/cortexlabs/cortex/issues/722 (vishalbollu, deliahu)
- Reduce cortex operator kubernetes API calls https://github.com/cortexlabs/cortex/pull/759 https://github.com/cortexlabs/cortex/issues/672 (deliahu)
- Reduce fluentd kubernetes API calls https://github.com/cortexlabs/cortex/pull/759 https://github.com/cortexlabs/cortex/issues/672 (vishalbollu)
- Add EKS control plane logging https://github.com/cortexlabs/cortex/pull/753 https://github.com/cortexlabs/cortex/issues/717 (vishalbollu)
- Enforce that bucket and cluster regions match https://github.com/cortexlabs/cortex/pull/777 (deliahu)
- Update ONNX runtime to 1.1.0 https://github.com/cortexlabs/cortex/commit/1e74ab7168534183710ab1f6dfe96666e08fb323 https://github.com/cortexlabs/cortex/commit/64f95b7598986a6937b16f0c21d820de4d353221 https://github.com/cortexlabs/cortex/issues/571 (deliahu)
- Direct users to check auto scaling group activity history if cluster up fails https://github.com/cortexlabs/cortex/pull/757 https://github.com/cortexlabs/cortex/issues/740 (vishalbollu)
- Pre-install opencv system packages https://github.com/cortexlabs/cortex/pull/772 (vishalbollu)
- Improve config validations https://github.com/cortexlabs/cortex/pull/751 https://github.com/cortexlabs/cortex/commit/c0a89a2224f98884cba2af53e8a108b7c32bf842 https://github.com/cortexlabs/cortex/issues/732 https://github.com/cortexlabs/cortex/issues/742 (deliahu, vishalbollu)
- Increase metrics server memory request/limit https://github.com/cortexlabs/cortex/commit/60f00b079f7b87321f0c9d87454c02e6ec25e8e2#diff-d62cba9784a96fc0a7471ca4d8b38e96 https://github.com/cortexlabs/cortex/issues/748 (deliahu)
- Disable operator autoscaling https://github.com/cortexlabs/cortex/pull/743 (vishalbollu)
- Add
kubectl toptocortex cluster info --debugoutput https://github.com/cortexlabs/cortex/pull/756 https://github.com/cortexlabs/cortex/issues/716 (vishalbollu)
- Go
Published by deliahu about 6 years ago
cortex - v0.12.0
New features
- Support new instance types (e.g. g3 and g4 instances) https://github.com/cortexlabs/cortex/pull/655 (deliahu)
- Support batched TensorFlow and ONNX predictions https://github.com/cortexlabs/cortex/pull/666 https://github.com/cortexlabs/cortex/issues/562 (vishalbollu)
- Allow users to configure availability zones https://github.com/cortexlabs/cortex/pull/681 https://github.com/cortexlabs/cortex/issues/677 (vishalbollu)
- Support multiple cortex clusters in the same region https://github.com/cortexlabs/cortex/pull/661 https://github.com/cortexlabs/cortex/pull/664 https://github.com/cortexlabs/cortex/issues/660 (deliahu)
- Add AWS resource pricing to
cortex cluster upconfirmation message https://github.com/cortexlabs/cortex/pull/647 https://github.com/cortexlabs/cortex/pull/690 https://github.com/cortexlabs/cortex/issues/641 (deliahu) - Autofill instance distribution based on spot price https://github.com/cortexlabs/cortex/pull/670 https://github.com/cortexlabs/cortex/issues/603 (vishalbollu)
- Add support for passing environment variables through to containers https://github.com/cortexlabs/cortex/pull/694 https://github.com/cortexlabs/cortex/issues/688 (vishalbollu)
Bug fixes
- Surface operator connection error messages in CLI https://github.com/cortexlabs/cortex/pull/659 https://github.com/cortexlabs/cortex/issues/658 (deliahu)
- Fix occasional logs stream errors https://github.com/cortexlabs/cortex/pull/689 (vishalbollu)
- Install pip packages with --no-cache-dir https://github.com/cortexlabs/cortex/pull/623 (vishalbollu)
Misc
- Convert predictor APIs into Python classes https://github.com/cortexlabs/cortex/pull/636 https://github.com/cortexlabs/cortex/pull/666 https://github.com/cortexlabs/cortex/issues/589 (vishalbollu)
- Rename sample to payload in Python APIs https://github.com/cortexlabs/cortex/pull/626 (vishalbollu)
- Confirm before deleting a deployment https://github.com/cortexlabs/cortex/pull/692 https://github.com/cortexlabs/cortex/issues/674 (vishalbollu)
- Check for unsupported instance types https://github.com/cortexlabs/cortex/commit/952a1f7af4a12c6560f2968961bfe4a7514c472f (deliahu)
- Check user EC2 limits before spinning up instances https://github.com/cortexlabs/cortex/pull/638 https://github.com/cortexlabs/cortex/pull/653 https://github.com/cortexlabs/cortex/issues/584 (vishalbollu, deliahu)
- Add
cortex cluster info --debugcommand https://github.com/cortexlabs/cortex/pull/691 https://github.com/cortexlabs/cortex/issues/657 (deliahu) - Remove upper limit on CPU target utilization https://github.com/cortexlabs/cortex/pull/635 (deliahu)
- Improve cortex deploy response message https://github.com/cortexlabs/cortex/pull/650 https://github.com/cortexlabs/cortex/issues/642 (deliahu)
- Improve API status output https://github.com/cortexlabs/cortex/pull/656 https://github.com/cortexlabs/cortex/issues/652 (deliahu)
- Improve spot config documentation https://github.com/cortexlabs/cortex/pull/670 https://github.com/cortexlabs/cortex/issues/627 (vishalbollu)
- Pre-download Docker images on cluster installation https://github.com/cortexlabs/cortex/pull/662 https://github.com/cortexlabs/cortex/issues/569 (deliahu)
- Remove
cortex supportcommand https://github.com/cortexlabs/cortex/pull/683 https://github.com/cortexlabs/cortex/issues/668 (vishalbollu) - Stream logs from all pods to CloudWatch https://github.com/cortexlabs/cortex/pull/671 https://github.com/cortexlabs/cortex/issues/586 (vishalbollu)
- Support running
cortex deployfrom subdirectories https://github.com/cortexlabs/cortex/pull/675 https://github.com/cortexlabs/cortex/issues/673 (deliahu) - Set log group and bucket name defaults to cluster name https://github.com/cortexlabs/cortex/pull/693 https://github.com/cortexlabs/cortex/issues/680 (vishalbollu)
- Go
Published by deliahu about 6 years ago
cortex - v0.11.1
Bug fixes
- Update healthcheck to use exec instead of using http healthz https://github.com/cortexlabs/cortex/pull/648 (vishalbollu)
Misc
- Allow configuring waitress parameters via API metadata https://github.com/cortexlabs/cortex/commit/5529873c05ba6cc28896d0b2167891794313f924 https://github.com/cortexlabs/cortex/commit/a354308512929584c90592170baf66583dd85bca (deliahu)
- Go
Published by deliahu about 6 years ago
cortex - v0.11.0
New features
- Support spot instances https://github.com/cortexlabs/cortex/pull/585 https://github.com/cortexlabs/cortex/pull/597 https://github.com/cortexlabs/cortex/issues/469 (vishalbollu)
Examples
- Add MLflow example https://github.com/cortexlabs/cortex/pull/566 https://github.com/cortexlabs/cortex/issues/553 (ospillinger)
- Add language identification example (fastText) https://github.com/cortexlabs/cortex/commit/0173bc432a748c59ea6d9bd8419d58805ac81c56 (ospillinger)
- Add answer generation example https://github.com/cortexlabs/cortex/pull/580 (ospillinger)
- Add reading comprehension example https://github.com/cortexlabs/cortex/pull/581 (ospillinger)
- Add text summarization example https://github.com/cortexlabs/cortex/commit/be42b7c1ec707b92e36dd95123615b17a9a26ff0 (ospillinger)
Misc
- Create separate nodegroup for Cortex operator containers https://github.com/cortexlabs/cortex/pull/577 https://github.com/cortexlabs/cortex/issues/500 (vishalbollu)
- Improve API logging https://github.com/cortexlabs/cortex/pull/596 https://github.com/cortexlabs/cortex/issues/587 (deliahu)
- Improve CLI output https://github.com/cortexlabs/cortex/pull/570 https://github.com/cortexlabs/cortex/issues/567 https://github.com/cortexlabs/cortex/issues/568 https://github.com/cortexlabs/cortex/issues/574 (deliahu)
- Update API info endpoint route and response https://github.com/cortexlabs/cortex/pull/594 https://github.com/cortexlabs/cortex/issues/593 (deliahu)
- Add quickstart / tutorial https://github.com/cortexlabs/cortex/pull/595 (ospillinger)
- Create CONTRIBUTING.md https://github.com/cortexlabs/cortex/pull/555 https://github.com/cortexlabs/cortex/issues/310 (ospillinger)
- Go
Published by vishalbollu about 6 years ago
cortex - v0.10.2
Bug fixes:
- Query node group min and max size from AWS AutoScalingGroup API https://github.com/cortexlabs/cortex/pull/576 https://github.com/cortexlabs/cortex/issues/575 (deliahu)
- Route API requests on exact URL matches, not prefix matches https://github.com/cortexlabs/cortex/commit/bb22971d79bad6f728b3a9716b511795208f65f9 (deliahu)
- Go
Published by deliahu over 6 years ago
cortex - v0.10.1
Bug fixes:
- Refresh logger after loading user modules https://github.com/cortexlabs/cortex/pull/563 (vishalbollu)
- Remove extra parameters that may be sent JSON tricks encoder initialization https://github.com/cortexlabs/cortex/pull/565 (vishalbollu)
- Set all TensorFlow version directory names to "1" https://github.com/cortexlabs/cortex/pull/560 https://github.com/cortexlabs/cortex/issues/354 (deliahu)
- Convert TensorFlow model prefix to a directory https://github.com/cortexlabs/cortex/commit/10b62b43edb436c3787bf0138203e04429494478 (deliahu)
Misc
- Show CLI version regardless of cluster status https://github.com/cortexlabs/cortex/commit/0e218ab1e8cc9aaef1a95b9c30a0d09327151516 (deliahu)
- Check EKS status before spinning up cluster https://github.com/cortexlabs/cortex/pull/561 (deliahu)
- Add
gitto API images https://github.com/cortexlabs/cortex/pull/559 (vishalbollu)
- Go
Published by deliahu over 6 years ago
cortex - v0.10.0
New features
- Add Predictor model format to support deploying generic Python models https://github.com/cortexlabs/cortex/pull/535 https://github.com/cortexlabs/cortex/pull/549 https://github.com/cortexlabs/cortex/issues/506 (vishalbollu)
- Ship with popular Python packages out of the box https://github.com/cortexlabs/cortex/pull/550 (vishalbollu)
- Add metadata field to API configuration https://github.com/cortexlabs/cortex/pull/535 https://github.com/cortexlabs/cortex/pull/544 https://github.com/cortexlabs/cortex/issues/522 (vishalbollu)
- Support custom API endpoints https://github.com/cortexlabs/cortex/pull/534 https://github.com/cortexlabs/cortex/issues/531 https://github.com/cortexlabs/cortex/issues/223 (deliahu)
- Move cluster installation and management commands to the CLI https://github.com/cortexlabs/cortex/pull/527 https://github.com/cortexlabs/cortex/issues/501 https://github.com/cortexlabs/cortex/issues/499 https://github.com/cortexlabs/cortex/issues/515 https://github.com/cortexlabs/cortex/issues/518 https://github.com/cortexlabs/cortex/issues/526 (deliahu)
Examples
- Add an example with Hugging Face's DistilGPT2 text generation model (PyTorch) https://github.com/cortexlabs/cortex/pull/535 https://github.com/cortexlabs/cortex/issues/507 (vishalbollu)
- Add an example for predicting miles per gallon to demonstrate regression (scikit-learn) https://github.com/cortexlabs/cortex/pull/535 https://github.com/cortexlabs/cortex/issues/497 (vishalbollu)
- Go
Published by deliahu over 6 years ago
cortex -
Bug fixes
- Fix bug in multi-input ONNX models https://github.com/cortexlabs/cortex/commit/a6bdb5f74342c2bfb60bb2ca1d4cd6f05006853a (vishalbollu)
- Don't update API metrics on non-POST requests https://github.com/cortexlabs/cortex/commit/f1bc223d7b6280b458b5296957d7dc0332155498 (deliahu)
Misc
- Add CORS headers https://github.com/cortexlabs/cortex/pull/540 (deliahu)
- Go
Published by deliahu over 6 years ago
cortex -
New features
- Add Cortex Python client https://github.com/cortexlabs/cortex/pull/488 https://github.com/cortexlabs/cortex/issues/467 (vishalbollu)
- Add Cortex support CLI command https://github.com/cortexlabs/cortex/pull/491 https://github.com/cortexlabs/cortex/issues/336 (vishalbollu)
- Add configure --print CLI command https://github.com/cortexlabs/cortex/commit/52ceae32b67d04d7ca6f6377c24d9aa5cdfa84db (deliahu)
Bug fixes:
- Prevent load balancer from timing out requests https://github.com/cortexlabs/cortex/pull/490 https://github.com/cortexlabs/cortex/commit/adcf18cce3901f17b8bcd4a7b87e0c532a4744e5 https://github.com/cortexlabs/cortex/issues/487 (vishalbollu)
- Remove unnecessary lock in operator init https://github.com/cortexlabs/cortex/commit/411bac6872b01306e3b9a5765db5edca5f3f0032 (deliahu)
- Silence stale API saved status not found errors https://github.com/cortexlabs/cortex/commit/aeac492b8b01a73bf4c986f4ddffa88422a593df (deliahu)
- Remove availability zone configuration https://github.com/cortexlabs/cortex/commit/2e8913ba5a21fa86494aaee8e45ae0ee1c976b08 https://github.com/cortexlabs/cortex/issues/494 (deliahu)
- Show correct URL upon failed HTTP request from CLI https://github.com/cortexlabs/cortex/pull/504 (vishalbollu)
Examples
- Shorten gpt-2 model output length https://github.com/cortexlabs/cortex/commit/1a56661bee2645ed12ffb9f232770c40c7ec8473 (deliahu)
Misc
- Validate access to cortex bucket on deploy https://github.com/cortexlabs/cortex/pull/511 https://github.com/cortexlabs/cortex/issues/502 (deliahu)
- Remove cortex namespace configuration option https://github.com/cortexlabs/cortex/commit/abdc0cd241d6a62b382039392afec5427ca77463 (deliahu)
- Go
Published by deliahu over 6 years ago
cortex -
New features
- Add prediction response tracking https://github.com/cortexlabs/cortex/pull/322 https://github.com/cortexlabs/cortex/pull/360 https://github.com/cortexlabs/cortex/pull/378 https://github.com/cortexlabs/cortex/pull/419 https://github.com/cortexlabs/cortex/pull/481 https://github.com/cortexlabs/cortex/commit/81718b4c9a750e685bebb52e79204fea10933ab4 https://github.com/cortexlabs/cortex/issues/225 (vishalbollu)
- Add networking metrics (latency, error codes) https://github.com/cortexlabs/cortex/pull/278 https://github.com/cortexlabs/cortex/pull/420 https://github.com/cortexlabs/cortex/pull/475 https://github.com/cortexlabs/cortex/pull/472 https://github.com/cortexlabs/cortex/issues/187 (vishalbollu, 1vn)
- Support importing local python files in handlers https://github.com/cortexlabs/cortex/pull/398 https://github.com/cortexlabs/cortex/pull/452 (1vn, vishalbollu)
- Support TensorFlow model directories on S3 https://github.com/cortexlabs/cortex/pull/323 https://github.com/cortexlabs/cortex/pull/373 https://github.com/cortexlabs/cortex/issues/215 https://github.com/cortexlabs/cortex/issues/366 (1vn)
- Support user-specified TensorFlow signature def keys https://github.com/cortexlabs/cortex/pull/365 https://github.com/cortexlabs/cortex/pull/471 https://github.com/cortexlabs/cortex/pull/459 https://github.com/cortexlabs/cortex/issues/343 (1vn, vishalbollu, deliahu)
- Improve signature def detection https://github.com/cortexlabs/cortex/pull/460 https://github.com/cortexlabs/cortex/commit/28dc9892f057a9141a8f6ae5385470ecbc4b4119 https://github.com/cortexlabs/cortex/issues/451 (vishalbollu)
- Add debug mode to API requests https://github.com/cortexlabs/cortex/pull/369 https://github.com/cortexlabs/cortex/issues/328 (1vn)
- Support print statements in handlers https://github.com/cortexlabs/cortex/pull/406 https://github.com/cortexlabs/cortex/pull/377 https://github.com/cortexlabs/cortex/issues/339 (vishalbollu, 1vn)
- Automatically configure operator URL when installing Cortex https://github.com/cortexlabs/cortex/pull/401 https://github.com/cortexlabs/cortex/issues/334 (1vn)
Bug fixes:
- Evict pods that consume too much memory https://github.com/cortexlabs/cortex/pull/426 https://github.com/cortexlabs/cortex/issues/424 (deliahu)
- Show logs from init containers https://github.com/cortexlabs/cortex/pull/393 https://github.com/cortexlabs/cortex/issues/324 (vishalbollu)
- Support "None" dims in model signatures https://github.com/cortexlabs/cortex/pull/465 (deliahu)
- Fix line wrapping with CLI --watch flag https://github.com/cortexlabs/cortex/commit/b4f72570108bd17a07ba7345c24b34a87ec11d4b (deliahu)
Examples
- Convert example model code to notebooks https://github.com/cortexlabs/cortex/pull/480 (deliahu)
- Add GPT-2 text generation example https://github.com/cortexlabs/cortex/pull/353 (1vn)
- Add Bert sentiment anaylsis example https://github.com/cortexlabs/cortex/pull/295 (1vn)
- Add Alexnex Pytorch example https://github.com/cortexlabs/cortex/pull/477 (vishalbollu)
- Add Imagenet inception example https://github.com/cortexlabs/cortex/pull/344 https://github.com/cortexlabs/cortex/issues/318 (1vn)
- Add normalizaion to iris sklearn example https://github.com/cortexlabs/cortex/pull/337 (deliahu)
Misc
- Remove sample key from prediction API and prediction key from prediction response https://github.com/cortexlabs/cortex/pull/399 https://github.com/cortexlabs/cortex/issues/389 (vishalbollu)
- Remove response key from TensorFlow prediction response https://github.com/cortexlabs/cortex/pull/478 (vishalbollu)
- Pass onnx model output directly to post_inference request handler https://github.com/cortexlabs/cortex/pull/476 (vishalbollu)
- Use HTTP endpoints by default https://github.com/cortexlabs/cortex/pull/350 https://github.com/cortexlabs/cortex/issues/327 (1vn)
- Remove verbose flag from logs command https://github.com/cortexlabs/cortex/pull/400 https://github.com/cortexlabs/cortex/issues/391 (vishalbollu)
- Add operator AWS credentials https://github.com/cortexlabs/cortex/pull/349 (ospillinger)
- Replace non-ready APIs without rolling update https://github.com/cortexlabs/cortex/pull/448 https://github.com/cortexlabs/cortex/issues/407 (deliahu)
- Autocast numpy objects to appropriate type https://github.com/cortexlabs/cortex/pull/384 https://github.com/cortexlabs/cortex/issues/338 (vishalbollu)
- Add out-of-memory error https://github.com/cortexlabs/cortex/pull/418 https://github.com/cortexlabs/cortex/issues/372 (deliahu)
- Add more fine-grained status messages https://github.com/cortexlabs/cortex/pull/440 https://github.com/cortexlabs/cortex/issues/408 (deliahu)
- Don't require --force once min replicas are met https://github.com/cortexlabs/cortex/pull/449 https://github.com/cortexlabs/cortex/issues/359 (deliahu)
- Validate that requested resources can fit in a node before deploying https://github.com/cortexlabs/cortex/pull/379 https://github.com/cortexlabs/cortex/issues/306 (1vn)
- Validate that request handlers exist before deploying https://github.com/cortexlabs/cortex/pull/438 https://github.com/cortexlabs/cortex/issues/427 https://github.com/cortexlabs/cortex/issues/428 (vishalbollu)
- Enforce zip file size limit https://github.com/cortexlabs/cortex/pull/457 https://github.com/cortexlabs/cortex/issues/437 (vishalbollu)
- Support numeric CPU values in API configuration https://github.com/cortexlabs/cortex/pull/413 https://github.com/cortexlabs/cortex/issues/395 (deliahu)
- Improve handler loading error messages https://github.com/cortexlabs/cortex/pull/382 https://github.com/cortexlabs/cortex/pull/352 https://github.com/cortexlabs/cortex/pull/479 https://github.com/cortexlabs/cortex/issues/292 https://github.com/cortexlabs/cortex/issues/341 (vishalbollu)
- Improve cortex.sh configuration and logging https://github.com/cortexlabs/cortex/commit/d4e77383e5eb5485134b95d2da10bd45d44ecd77 https://github.com/cortexlabs/cortex/commit/577b31e5f13f370679673c4aa7e0ce2da1e58196 https://github.com/cortexlabs/cortex/commit/5a96fa706a93bcf5cbcd65092679f3fc23ec1949 https://github.com/cortexlabs/cortex/commit/08fde5b83e350934dadb9a8dbd6414c52f32a07a https://github.com/cortexlabs/cortex/commit/994a49bfedf677ad5842561ceb8e93882efba871 https://github.com/cortexlabs/cortex/commit/4c196a6e7bc86090d731d8380afc41f7ebac1ac5 (deliahu, ospillinger)
- Add timestamp to logs https://github.com/cortexlabs/cortex/pull/402 https://github.com/cortexlabs/cortex/issues/390 (vishalbollu)
- Only read deployment configuration from cortex.yaml https://github.com/cortexlabs/cortex/pull/396 https://github.com/cortexlabs/cortex/issues/387 (deliahu)
- Rename default environment to "default" https://github.com/cortexlabs/cortex/commit/6051dcdeb4017c1e44bc6abbf2924c9ccf65f0a5 (deliahu)
- Set Python version to 3.6 https://github.com/cortexlabs/cortex/pull/461 (deliahu)
- Update TensorFlow version to 1.14 https://github.com/cortexlabs/cortex/commit/ba0b541fdfa1492215beea18209cf632df2c9234 (deliahu)
- Use Istio for networking https://github.com/cortexlabs/cortex/pull/237 https://github.com/cortexlabs/cortex/pull/374 https://github.com/cortexlabs/cortex/issues/201 (1vn deliahu)
- Stream logs from cloudwatch https://github.com/cortexlabs/cortex/pull/447 https://github.com/cortexlabs/cortex/pull/466 (vishalbollu)
- Go
Published by deliahu over 6 years ago
cortex -
Bug fixes:
- Update TensorFlow createrawprediction_request https://github.com/cortexlabs/cortex/commit/55f9916bc4caeb83ede891c0ee575be345910aac https://github.com/cortexlabs/cortex/commit/bcc54016867054dcf13fce6e19fed0f0b63c7590 (1vn)
Misc
- CLI output redesign https://github.com/cortexlabs/cortex/pull/335 https://github.com/cortexlabs/cortex/issues/297 (vishalbollu)
- Add upgrade docs https://github.com/cortexlabs/cortex/commit/97e92820af26d18be69e671599a61dd4d9cb69a4 (ospillinger)
- Go
Published by deliahu over 6 years ago
cortex -
Bug fixes:
- Allow unlimited request body size https://github.com/cortexlabs/cortex/commit/f846bd8356699770bb596bacdd5a41bc814ff917 (deliahu)
- Increase TF Serving GRPC timeout https://github.com/cortexlabs/cortex/commit/b7935246af64951f541971e779c96ad8ba80ea51 (1vn)
- Validate prepared sample https://github.com/cortexlabs/cortex/commit/e8df4f59565a92513290167d057551b86e4deff0 (1vn)
- Add TensorFlow data types https://github.com/cortexlabs/cortex/commit/73471d50546d46efb660815ee98bd759e907b232 (deliahu)
- Fix shape inference for TF serving https://github.com/cortexlabs/cortex/commit/ec3a798749d74ff5d94b39561a58042bb529d7ca (1vn)
Misc
- Improve model schema printing on errors https://github.com/cortexlabs/cortex/commit/b7935246af64951f541971e779c96ad8ba80ea51 (vishalbollu)
- Add API request logging https://github.com/cortexlabs/cortex/commit/483fb13c49eed6b4f35d3e35b0ff17dcf2ac3b40 (deliahu)
- Use tensorflow/serving Docker images https://github.com/cortexlabs/cortex/commit/2e7a4ac0d5e974af21abcf934fbe9f36f4a7ac10 (deliahu)
- Remove sample logging https://github.com/cortexlabs/cortex/commit/b7935246af64951f541971e779c96ad8ba80ea51 (1vn)
- Go
Published by deliahu over 6 years ago
cortex -
Bug fixes:
- Show TensorFlow Serving logs https://github.com/cortexlabs/cortex/pull/299 https://github.com/cortexlabs/cortex/issues/300 (vishalbollu)
- Update AWS CNI version to 1.5.2 https://github.com/cortexlabs/cortex/pull/291 https://github.com/cortexlabs/cortex/issues/247 https://github.com/cortexlabs/cortex/issues/219 (deliahu)
- Fix handler packages https://github.com/cortexlabs/cortex/pull/298 (vishalbollu)
- Update pod status logic https://github.com/cortexlabs/cortex/commit/d9ced094145ab95959f4366c92be8761832c2250 https://github.com/cortexlabs/cortex/commit/af9059fea9287a16bc89bc9dc545f7a114bcb7b2 https://github.com/cortexlabs/cortex/commit/ee4b3fc69b04e2cac2554b500e66ac61e54337c0 (deliahu)
- Add retries to metadata downloading https://github.com/cortexlabs/cortex/pull/286 https://github.com/cortexlabs/cortex/issues/45 (deliahu)
Misc
- Update to TensorFlow 1.14.0 https://github.com/cortexlabs/cortex/commit/ba0b541fdfa1492215beea18209cf632df2c9234 https://github.com/cortexlabs/cortex/issues/302 (deliahu)
- Go
Published by deliahu over 6 years ago
cortex -
New features:
- Add GPU support for serving ONNX models #232 #233 #220 (vishalbollu)
- Set model format based on path if not explicitly specified #251 #206 (vishalbollu)
- Improve get command output for APIs #263 #177 #257 #256 (vishalbollu)
- Aggregate API logs in cortex logs command #227 #214 (vishalbollu)
- Aggregate API logs in CloudWatch #259 #226 (vishalbollu)
- Add CLI command to list active Cortex deployments #268 #117 (vishalbollu)
Misc:
- Improve API ready timestamp #244 (deliahu)
- Prevent scaling immediately after API creation #255 #222 (deliahu)
- Add Cortex Namespace to python modules #230 #205 (vishalbollu)
- Allow ctrl+c to kill the manager process #252 #246 (vishalbollu)
- Improve logging around request handlers #240 #207 #204 (vishalbollu)
- Improve error message for prediction api mismatch #249 #176 (vishalbollu)
- Make all logs one line #216 (vishalbollu)
- Document how to add system packages to docker containers #250 #245 (vishalbollu)
- Replace Argo with in-operator DAG manager #235 #218 (deliahu)
- Improve uninstall process #711017f (deliahu)
- Restructure iris example #270 #266 (vishalbollu)
- Go
Published by deliahu over 6 years ago
cortex -
New features:
- Add cluster autoscaler #194 #189 (ospillinger)
- Add pod autoscaler #196 #188 (deliahu)
- Automate/improve installation process, add manager image #193 #192 (ospillinger)
- Add support for serving ONNX models #182 #181 #164 (vishalbollu)
- Support Python pre- and post-processing for inference #182 #178 (vishalbollu)
Misc:
- Go
Published by deliahu over 6 years ago
cortex -
New features:
- Remove status command, fold into get and logs #171 #166 #165 (deliahu)
- Remove region for external data fa227d1 #174 (deliahu)
- Return expected input schema for prediction request errors febc293 (deliahu)
Bugs fixes
- Show previous logs for failed API pods f620125 #179 (deliahu)
- Fix external constants ec96d80 (deliahu)
Misc:
- Hide end-to-end components if only using serving ff4910a b0666fb #180 #167 (deliahu)
- Rename app to deployment #175 #180 (deliahu)
- Rename app.yaml to cortex.yaml 355fdfc #168 (deliahu)
- Improve get command resource printing 5e301c6 (deliahu)
- Improve python error message if external data doesn't exist #183 (deliahu)
- Remove
initCLI command a044d81 #172 (deliahu) - Update to go 1.12 #170 #169 (deliahu)
- Go
Published by deliahu over 6 years ago
cortex - v0.4.0
New features:
- Input redesign #72 #154 (deliahu)
- Add estimators #72 #154 (deliahu)
- Support deploying external TensorFlow models #124 #154 (1vn)
- Make raw columns optional #103 #111 (1vn)
- Make aggregators and transformers optional #90 #100 (1vn)
- Respond to prediction request with transformed columns #97 #153 (1vn)
- Support bucket regions for data ingestion #115 #155 (vishalbollu)
- Support not using an ingested column as a raw_column #69 #92 (vishalbollu)
- Update to TensorFlow 1.13 #95 #116 (1vn)
- Update to Spark 2.4.2 #87 (vishalbollu)
- Validate app name does not have underscore #59 #112 (1vn)
Bug fixes:
- Resolve Spark Context file added warnings #79 #137 (1vn)
- Improve built-in index_string data format #68 #127 (1vn)
- Address TF Serving gRPC Warning #61 #128 (1vn)
- Ingestion of Parquet containing int or double columns throw validation errors #91 #92 (vishalbollu)
- Update Argo version #74 #125 (1vn)
- API is sometimes temporarily unavailable when updating #71 #85 (deliahu)
- Resources not allocated to Spark workloads to generate training datasets #56 #86 (vishalbollu)
Misc:
- Go
Published by deliahu over 6 years ago
cortex - v0.3.0
Merged pull requests:
- Rename
transformed_columnparameter intransform_spark()#49 (deliahu) - OOM (Out of memory) status #40 (1vn)
- Change status to ingesting only after enough resources have been allocated #39 (vishalbollu
- Change default TensorFlow log level to DEBUG #37 (1vn)
- Transformer model sentiment analysis example #36 (1vn)
- Add integration test to spark workloads #35 (vishalbollu)
- Tensor2Tensor Example and transform_tensorflow feature #29 (1vn)
- Go
Published by deliahu almost 7 years ago
cortex -
Merged pull requests:
- Allow specifying ranges in cortex requirements.txt #32 (vishalbollu)
- Prevent users from installing conflicting packages #30 (vishalbollu)
- Add additional config path error wrapping and index to embeds #15 (deliahu)
- Show config path in config errors #14 (1vn)
- Add ability to sample dataset #12 (vishalbollu)
- Expose additional csv parsing options #10 (vishalbollu)
- GPU support #6 (1vn)
- Bring your own package #5 (vishalbollu)
- Go
Published by deliahu almost 7 years ago