v0.42.1

New features

Add support for new set of EC2 instances amongst which the c6 and g5 families can be found https://github.com/cortexlabs/cortex/issues/2414 (RobertLucian)

Bug fixes

Esthetic fix where the VPC CNI logging functionality was triggering warn logs when running the cortex CLI https://github.com/cortexlabs/cortex/pull/2443 (RobertLucian)

Misc

Update Cortex dependency versions; eksctl, EKS to 1.22, AWS IAM, Python, etc https://github.com/cortexlabs/cortex/issues/2414 (RobertLucian, deliahu)

- Go
Published by RobertLucian over 3 years ago

v0.42.0

New features

Add support for the Classic Load Balancer for APIs; the Network Load Balancer remains the default (docs) https://github.com/cortexlabs/cortex/pull/2413 https://github.com/cortexlabs/cortex/issues/2414 (RobertLucian)

Bug fixes

Fix Async API http/tcp probes when probing the empty root path (/) https://github.com/cortexlabs/cortex/pull/2407 (RobertLucian)
Fix nil pointer exception in the cortex cluster export command https://github.com/cortexlabs/cortex/pull/2415 https://github.com/cortexlabs/cortex/issues/2414 (RobertLucian)
Ensure that user-specified environment variables are ordered deterministically in the Kubernetes deployment spec https://github.com/cortexlabs/cortex/pull/2411 (deliahu)

Misc

Ensure that the batch on-job-complete request contains a valid JSON body https://github.com/cortexlabs/cortex/pull/2409 (RobertLucian)

- Go
Published by deliahu over 4 years ago

v0.41.0

New features

Support configurable pre_stop command for containers https://github.com/cortexlabs/cortex/pull/2403 (docs) (deliahu)

Misc

Support m6i instance types https://github.com/cortexlabs/cortex/pull/2398 (deliahu)
Update to Kubernetes v1.21 https://github.com/cortexlabs/cortex/pull/2398 (deliahu)

Bug fixes

Wait for in-flight requests to reach zero before terminating the proxy container https://github.com/cortexlabs/cortex/pull/2402 (deliahu)
Fix cortex get --env command https://github.com/cortexlabs/cortex/pull/2404 (deliahu)
Fix cluster price estimate during cortex cluster up for spot node groups with on-demand base capacity https://github.com/cortexlabs/cortex/pull/2406 (RobertLucian)

Nucleus Model Server

We have released v0.1.0 of the Nucleus model server!

Nucleus is a model server for TensorFlow and generic Python models. It is compatible with Cortex clusters, Kubernetes clusters, and any other container-based deployment platforms. Nucleus can also be run locally via Docker compose.

Some of Nucleus's features include:

Generic Python models (PyTorch, ONNX, Sklearn, MLFlow, Numpy, Pandas, etc)
TensorFlow models
CPU and GPU support
Serve models directly from S3 paths
Configurable multiprocessing and multithreadding
Multi-model endpoints
Dynamic server-side request batching
Automatic model reloading when new model versions are uploaded to S3
Model caching based on LRU policy (on disk and memory)
HTTP and gRPC support

- Go
Published by deliahu over 4 years ago

cortex - v0.40.0

v0.40.0

New features

Support concurrency for Async APIs (via the max_concurrency field) https://github.com/cortexlabs/cortex/pull/2376 https://github.com/cortexlabs/cortex/issues/2200 (miguelvr)
Add graphs for cluster-wide and per-API cost breakdowns to the cluster metrics dashboard https://github.com/cortexlabs/cortex/pull/2382 https://github.com/cortexlabs/cortex/issues/1962 (RobertLucian)
Allow worker nodes containing Async APIs to scale to zero (now a shared async gateway is used, which runs on the operator node group) https://github.com/cortexlabs/cortex/pull/2380 https://github.com/cortexlabs/cortex/issues/2279 (vishalbollu)
Add cortex describe API_NAME command for Realtime and Async APIs https://github.com/cortexlabs/cortex/pull/2368 https://github.com/cortexlabs/cortex/issues/2320 https://github.com/cortexlabs/cortex/issues/2359 (RobertLucian)
Support updating the priority of an existing node group https://github.com/cortexlabs/cortex/pull/2369 https://github.com/cortexlabs/cortex/issues/2254 (vishalbollu)

Misc

Improve the reporting of API statuses https://github.com/cortexlabs/cortex/pull/2368 https://github.com/cortexlabs/cortex/issues/2320 https://github.com/cortexlabs/cortex/issues/2359 (RobertLucian)
Remove the default readiness probe on the target port if a custom readiness probe is specified in the API spec https://github.com/cortexlabs/cortex/pull/2379 (RobertLucian)

- Go
Published by deliahu almost 5 years ago

cortex - v0.39.1

v0.39.1

Bug fixes

Remove an unnecessary cluster validation which limited the IP ranges that could be used in api_load_balancer_cidr_white_list and operator_load_balancer_cidr_white_list https://github.com/cortexlabs/cortex/pull/2363 (RobertLucian)

- Go
Published by deliahu almost 5 years ago

cortex - v0.39.0

v0.39.0

New features

Add cortex cluster health command to show the health of the cluster's components https://github.com/cortexlabs/cortex/pull/2313 https://github.com/cortexlabs/cortex/issues/2029 (miguelvr)
Forward request headers to AsyncAPIs https://github.com/cortexlabs/cortex/pull/2329 https://github.com/cortexlabs/cortex/issues/2296 (miguelvr)
Add metrics dashboard for Task APIs https://github.com/cortexlabs/cortex/pull/2311 https://github.com/cortexlabs/cortex/pull/2322 (RobertLucian)

Reliability

Enable larger cluster sizes (up to 1000 nodes with 10000 pods) by enabling IPVS https://github.com/cortexlabs/cortex/pull/2357 https://github.com/cortexlabs/cortex/issues/1834 (RobertLucian)
Automatically limit the rate at which nodes are added to avoid overloading the Kubernetes API server https://github.com/cortexlabs/cortex/pull/2331 https://github.com/cortexlabs/cortex/pull/2338 https://github.com/cortexlabs/cortex/issues/2314 (RobertLucian)
Ensure cluster autoscaler availability https://github.com/cortexlabs/cortex/pull/2347 https://github.com/cortexlabs/cortex/issues/2346 (RobertLucian)
Improve istiod availability at large scale https://github.com/cortexlabs/cortex/pull/2342 https://github.com/cortexlabs/cortex/issues/2332 (RobertLucian)
Reduce metrics shown in cortex get to improve scalability and reliability of the command https://github.com/cortexlabs/cortex/pull/2333 https://github.com/cortexlabs/cortex/issues/2319 (vishalbollu)
Show aggregated node statistics in the cluster dashboard https://github.com/cortexlabs/cortex/pull/2336 https://github.com/cortexlabs/cortex/issues/2318 (RobertLucian)

Bug fixes

Ensure that the Content-Type header is properly set to application/json for responses to Async API submissions https://github.com/cortexlabs/cortex/pull/2323 (vishalbollu)
Fix pod autoscaler scale-to-zero edge cases https://github.com/cortexlabs/cortex/pull/2350 (miguelvr)
Allow autoscaling configuration to be updated on a running API https://github.com/cortexlabs/cortex/pull/2355 (RobertLucian)
Fix node group priority calculation for the cluster autoscaler https://github.com/cortexlabs/cortex/pull/2358 https://github.com/cortexlabs/cortex/pull/2343 (RobertLucian, deliahu)
Allow the node_groups selector to be updated in a running API https://github.com/cortexlabs/cortex/pull/2354 (RobertLucian)
Fix the active replicas graph on the Async API dashboard https://github.com/cortexlabs/cortex/pull/2328 (RobertLucian)

Docs

Add a guide for running in production https://github.com/cortexlabs/cortex/pull/2334 https://github.com/cortexlabs/cortex/issues/2317 (vishalbollu)
Add a guide for configuring an HTTP API Gateway https://github.com/cortexlabs/cortex/pull/2341 (deliahu)

Misc

Add a graph of the number of active and queued requests to the Async API dashboard https://github.com/cortexlabs/cortex/pull/2326 https://github.com/cortexlabs/cortex/issues/1960 (deliahu)
Add a graph of the number of instances to the cluster dashboard https://github.com/cortexlabs/cortex/pull/2336 https://github.com/cortexlabs/cortex/issues/2318 (RobertLucian)
Ensure that cortex cluster info --print-config displays YAML that is consumable by cortex cluster configure https://github.com/cortexlabs/cortex/pull/2324 (vishalbollu)

- Go
Published by deliahu almost 5 years ago

cortex - v0.38.0

v0.38.0

New features

Support autoscaling down to zero replicas for Realtime APIs https://github.com/cortexlabs/cortex/pull/2298 https://github.com/cortexlabs/cortex/issues/445 (miguelvr)
Allow ssl_certificate_arn, api_load_balancer_cidr_white_list, and operator_load_balancer_cidr_white_list to be updated on an existing cluster (via the cortex cluster configure command) https://github.com/cortexlabs/cortex/pull/2305 https://github.com/cortexlabs/cortex/issues/2107 (vishalbollu)
Allow Prometheus's instance type to be configured (docs) https://github.com/cortexlabs/cortex/pull/2307 https://github.com/cortexlabs/cortex/issues/2285 (RobertLucian)
Allow multiple Inferentia chips to be assigned to a single container https://github.com/cortexlabs/cortex/pull/2304 https://github.com/cortexlabs/cortex/issues/1123 (deliahu)

Bug fixes

Fix cluster autoscaler's nodegroup priority calculation https://github.com/cortexlabs/cortex/pull/2309 (RobertLucian)

Misc

Various scalability improvements https://github.com/cortexlabs/cortex/pull/2307 https://github.com/cortexlabs/cortex/pull/2304 https://github.com/cortexlabs/cortex/issues/2297 https://github.com/cortexlabs/cortex/issues/2278 https://github.com/cortexlabs/cortex/issues/2285
Allow setting a nodegroup's max_instances to 0 https://github.com/cortexlabs/cortex/pull/2310 (RobertLucian)

- Go
Published by deliahu almost 5 years ago

cortex - v0.37.0

v0.37.0

New features

Support ARM instance types https://github.com/cortexlabs/cortex/pull/2268 https://github.com/cortexlabs/cortex/issues/1528 (RobertLucian)
Add cortex cluster configure command to add, remove, or scale nodegroups on a running cluster https://github.com/cortexlabs/cortex/pull/2246 https://github.com/cortexlabs/cortex/issues/2096 (RobertLucian)
Add cortex cluster info --print-config command to print the current configuration of a running cluster https://github.com/cortexlabs/cortex/pull/2246 (RobertLucian)
Add metrics dashboard for Async APIs https://github.com/cortexlabs/cortex/pull/2242 https://github.com/cortexlabs/cortex/issues/1958 (miguelvr)
Support cortex refresh command for Async APIs https://github.com/cortexlabs/cortex/pull/2265 https://github.com/cortexlabs/cortex/issues/2237 (deliahu)

Breaking changes

The cortex cluster scale command has been replaced by the cortex cluster configure command.

Bug fixes

Fix Async API metrics reporting for non-200 response status codes https://github.com/cortexlabs/cortex/pull/2266 (miguelvr)
Make batch job metrics persistence resilient to instance termination https://github.com/cortexlabs/cortex/pull/2247 https://github.com/cortexlabs/cortex/issues/2041 (vishalbollu)
Make network validations during cortex cluster up more permissive (to avoid unnecessarily failing checks on GovCloud) https://github.com/cortexlabs/cortex/pull/2248 (vishalbollu)
Fix Inferentia resource requests https://github.com/cortexlabs/cortex/pull/2250 (RobertLucian)

Docs

Add instructions for exporting logs and metrics to external tools (vishalbollu)

Misc

Improve output of cortex cluster info for running batch jobs https://github.com/cortexlabs/cortex/pull/2270 (deliahu)
Persist Batch job metrics regardless of job status https://github.com/cortexlabs/cortex/pull/2244 (miguelvr)
Support creating clusters with no node groups https://github.com/cortexlabs/cortex/pull/2269 (deliahu)
Improve handling of container startup errors in batch jobs with multiple containers https://github.com/cortexlabs/cortex/pull/2260 https://github.com/cortexlabs/cortex/issues/2217 (vishalbollu)
Add CPU and memory resource requests to the proxy and dequeuer containers https://github.com/cortexlabs/cortex/pull/2252 (deliahu)

- Go
Published by deliahu almost 5 years ago

cortex - v0.36.0

v0.36.0

New features

Support running arbitrary Docker containers in all workload types (Realtime, Async, Batch, Task) https://github.com/cortexlabs/cortex/pull/2173 (RobertLucian, miguelvr, vishalbollu, deliahu, ospillinger)
Support autoscaling Async APIs to zero replicas https://github.com/cortexlabs/cortex/pull/2224 https://github.com/cortexlabs/cortex/issues/2199 (RobertLucian)

Breaking changes

With this release, we have generalized Cortex to exclusively support running arbitrary Docker containers for all workload types (Realtime, Async, Batch, and Task). This enables the use of any model server, programming language, etc. As a result, the API configuration has been updated: the predictor section has been removed, the pod section has been added, and the autoscaling parameters have been modified slightly (depending on the workload type). See updated docs for Realtime, Async, Batch, and Task. If you'd like to to see examples of Dockerizing Python applications, see our test/apis folder.
The cortex prepare-debug command has been removed; Cortex now exclusively runs Docker containers, which can be run locally via docker run.
The cortex patch command as been removed; its behavior is now identical to cortex deploy.
The cortex logs command now prints a CloudWatch Insights URL with a pre-populated query which can be executed to show logs from your workloads, since this is the recommended approach in production. If you wish to stream logs from a pod at random, you can use cortex logs --random-pod (keep in mind that these logs will not include some system logs related to your workload).
gRPC support has been temporarily removed; we are working on adding it back in v0.37.

Bug fixes

Handle exception when initializing the Python client when the default environment is not set https://github.com/cortexlabs/cortex/pull/2225 https://github.com/cortexlabs/cortex/issues/2223 (deliahu)

Docs

Document how to configure SMTP in Grafana (e.g to enable email alerts) https://github.com/cortexlabs/cortex/pull/2219 (RobertLucian)

Misc

Show CloudWatch Insights URL with a pre-populated query in the output of cortex logs https://github.com/cortexlabs/cortex/issues/2085 (vishalbollu)
Improve efficiency of batch job submission validations https://github.com/cortexlabs/cortex/pull/2179 https://github.com/cortexlabs/cortex/issues/2178 (deliahu)

- Go
Published by deliahu almost 5 years ago

cortex - v0.35.0

v0.35.0

New features

Avoid processing HTTP requests that have been cancelled by the client https://github.com/cortexlabs/cortex/pull/2135 https://github.com/cortexlabs/cortex/issues/1453 (vishalbollu)
Support GP3 volumes (and make GP3 the default volume type) https://github.com/cortexlabs/cortex/pull/2130 https://github.com/cortexlabs/cortex/issues/1843 (RobertLucian)
Allow setting the shared memory (shm) size for Task APIs https://github.com/cortexlabs/cortex/pull/2132 https://github.com/cortexlabs/cortex/issues/2115 (RobertLucian)
Implement automatic 7-day expiration for Async API responses https://github.com/cortexlabs/cortex/pull/2151 (RobertLucian)
Add cortex env rename command https://github.com/cortexlabs/cortex/pull/2165 https://github.com/cortexlabs/cortex/issues/1773 (deliahu)

Breaking changes

The Python client methods which deploy Python classes have been separated from the deploy() method. Now, deploy() is used only to deploy project folders, and deploy_realtime_api(), deploy_async_api(), deploy_batch_api(), and deploy_task_api() are for deploying Python classes. (docs)
The name of the bucket that Cortex uses for internal purposes is no longer configurable. During cluster creation, Cortex will auto-generate the bucket name (and create the bucket if it doesn't exist). During cluster deletion, the bucket will be emptied (unless the --keep-aws-resources flag is provided to cortex cluster down). Users' files should not be stored in the Cortex internal bucket.

Bug fixes

Fix the number of Async API replicas shown in cortex cluster info https://github.com/cortexlabs/cortex/pull/2140 https://github.com/cortexlabs/cortex/issues/2129 (RobertLucian)

Misc

Delete all cortex-created AWS resources when deleting a cluster, and support the --keep-aws-resources flag with cortex cluster down to preserve AWS resources https://github.com/cortexlabs/cortex/pull/2161 https://github.com/cortexlabs/cortex/issues/1612 (RobertLucian)
Validate the user's AWS service quota for number of security groups and in/out rules during cluster creation https://github.com/cortexlabs/cortex/pull/2127 https://github.com/cortexlabs/cortex/issues/2087 (RobertLucian)
Allow specifying only one of --min-instances or --max-instances with cortex cluster scale https://github.com/cortexlabs/cortex/pull/2149 (RobertLucian)
Use 405 status code for un-implemented Realtime API methods https://github.com/cortexlabs/cortex/pull/2158 (RobertLucian)
Decrease file size and project size limits https://github.com/cortexlabs/cortex/pull/2152 (deliahu)
Set the default environment name to the cluster name when creating a cluster https://github.com/cortexlabs/cortex/pull/2164 https://github.com/cortexlabs/cortex/issues/1546 (deliahu)

- Go
Published by deliahu about 5 years ago

cortex - v0.34.0

v0.34.0

New features

Support handling GET, PUT, PATCH, and DELETE HTTP requests in Realtime APIs (docs) https://github.com/cortexlabs/cortex/pull/2111 https://github.com/cortexlabs/cortex/issues/2063 (RobertLucian)
Support running realtime API containers locally for debugging / development purposes (docs) https://github.com/cortexlabs/cortex/pull/2112 https://github.com/cortexlabs/cortex/issues/2077 (vishalbollu)
Support multiple gRPC services / methods (which can be named arbitrarily) in a single Realtime API (docs) https://github.com/cortexlabs/cortex/pull/2111 https://github.com/cortexlabs/cortex/issues/2063 (RobertLucian)
Support specifying a list of node groups on which a workload is allowed to run (see configuration docs for Realtime, Async, Batch, or Task APIs) https://github.com/cortexlabs/cortex/pull/2098 https://github.com/cortexlabs/cortex/issues/2034 (RobertLucian)
Support AWS GovCloud regions https://github.com/cortexlabs/cortex/pull/2118 https://github.com/cortexlabs/cortex/issues/2103 (vishalbollu)

Breaking changes

"predictor" has been renamed to "handler" throughout the product (API configuration and Python APIs). In addition, as a result of supporting additional HTTP method verbs, predict() has been renamed to handle_post() in Realtime APIs (handle_get(), handle_put(), handle_patch(), and handle_delete() are now also supported). For consistency, predict() has been renamed to handle_async() for Async APIs, and handle_batch() for Batch APIs. See the examples for Realtime, Async, and Batch APIs. Task APIs have not been changed.

Bug fixes

Fix invalid Async workload status during processing https://github.com/cortexlabs/cortex/pull/2106 https://github.com/cortexlabs/cortex/issues/2104 (RobertLucian)

Docs

Add docs for configuring Grafana alerts (RobertLucian)
Document how to create a Cortex cluster without administrator IAM access (vishalbollu)
Add docs for mirroring Cortex's docker images to a private repo (vishalbollu)

Misc

Support json output for the cortex cluster info command https://github.com/cortexlabs/cortex/pull/2089 https://github.com/cortexlabs/cortex/issues/2062 (RobertLucian)
Allow nodegroups to be scaled down to max_instances == 0 https://github.com/cortexlabs/cortex/pull/2095 (deliahu)

- Go
Published by deliahu about 5 years ago

cortex - v0.33.0

v0.33.0

New features

Allow specifying a CIDR range whitelist for APIs and the operator (docs) https://github.com/cortexlabs/cortex/pull/2071 https://github.com/cortexlabs/cortex/issues/2003 (vishalbollu)
Enable CORS for async, batch, and task APIs https://github.com/cortexlabs/cortex/pull/2082 https://github.com/cortexlabs/cortex/issues/2073 (deliahu)

Breaking changes

The onnx predictor type has been replaced by the python predictor type; please use the python predictor type instead (all onnx models are fully supported by the python predictor type)

Bug fixes

Fix bug affecting async api consistency during heavy traffic https://github.com/cortexlabs/cortex/pull/2072 (RobertLucian)
Fix bug affecting async api updates https://github.com/cortexlabs/cortex/pull/2067 (vishalbollu)

Misc

Rename cortex cluster configure command to cortex cluster scale https://github.com/cortexlabs/cortex/pull/2040 https://github.com/cortexlabs/cortex/issues/1972 (RobertLucian)
Disable AZRebalance autoscaling group process https://github.com/cortexlabs/cortex/pull/2042 https://github.com/cortexlabs/cortex/issues/1349 (RobertLucian)
Add horizontal pod autoscaler to async API gateway https://github.com/cortexlabs/cortex/pull/2079 https://github.com/cortexlabs/cortex/issues/2078 (RobertLucian)
Rename async modules to async_api to avoid name collision with the reserved keyword in Python 3.7+ https://github.com/cortexlabs/cortex/pull/2066 https://github.com/cortexlabs/cortex/issues/2052 (vishalbollu)
Backup images to dockerhub https://github.com/cortexlabs/cortex/pull/2081 (vishalbollu)
Add additional debugging info for cluster up failures https://github.com/cortexlabs/cortex/pull/2080 https://github.com/cortexlabs/cortex/issues/2027 (vishalbollu)

- Go
Published by deliahu about 5 years ago

cortex - v0.32.0

v0.32.0

New features

Add gRPC support to realtime APIs (docs) https://github.com/cortexlabs/cortex/pull/1997 https://github.com/cortexlabs/cortex/issues/1056 (RobertLucian)
Add support for ONNX and TensorFlow predictor types in async APIs (docs) https://github.com/cortexlabs/cortex/pull/1996 https://github.com/cortexlabs/cortex/issues/1980 (miguelvr)
Support using ECR images from other AWS accounts and regions https://github.com/cortexlabs/cortex/pull/2011 https://github.com/cortexlabs/cortex/issues/1988 (vishalbollu)

Breaking changes

GCP support has been removed so that we can focus our efforts on improving the scalability, reliability, and security for Cortex on AWS. Cortex on GCP will still be available in v0.31. If you are currently using Cortex on GCP, our team will be happy to help you migrate to AWS or work with you to find alternative solutions. Please feel free to reach out to us on slack or email us at hello@cortex.dev if you're interested.

Bug fixes

Fix memory plots on Grafana dashboards for realtime and batch APIs https://github.com/cortexlabs/cortex/pull/2024 https://github.com/cortexlabs/cortex/pull/2014 https://github.com/cortexlabs/cortex/issues/1970 (RobertLucian)

Docs

Misc docs improvements https://github.com/cortexlabs/cortex/pull/1994 (ospillinger)

Misc

Increase kubelet's registryPullQPS limit from 5 to 10 https://github.com/cortexlabs/cortex/pull/2023 https://github.com/cortexlabs/cortex/issues/1989 (miguelvr)
Pin the AMI version https://github.com/cortexlabs/cortex/pull/2010 https://github.com/cortexlabs/cortex/issues/1975 https://github.com/cortexlabs/cortex/issues/1615 (vishalbollu)

- Go
Published by deliahu about 5 years ago

cortex - v0.31.1

v0.31.1

Bug fixes

Preemptible node pools on GCP aren't autoscaling https://github.com/cortexlabs/cortex/pull/1981 (vishalbollu)
Replica autoscaler targets incorrect deployments on operator restart https://github.com/cortexlabs/cortex/pull/1982 (miguelvr)
Replica autoscaler is not reinitialized for running APIs on operator restart on GCP https://github.com/cortexlabs/cortex/pull/1984 (vishalbollu)

- Go
Published by vishalbollu about 5 years ago

cortex - v0.31.0

v0.31.0

New features

Add support for AsyncAPI (experimental) (docs) https://github.com/cortexlabs/cortex/pull/1935 https://github.com/cortexlabs/cortex/issues/1610 (miguelvr)
Add support for multi-instance-type clusters to AWS/GCP providers (experimental) (aws/gcp docs) https://github.com/cortexlabs/cortex/pull/1951 (RobertLucian)
Allow users to duplicate/mirror traffic using shadow pipelines https://github.com/cortexlabs/cortex/pull/1948 https://github.com/cortexlabs/cortex/issues/1889 (docs) (vishalbollu)

Breaking changes

on_demand_backup in cluster configuration has been removed in favour of using a cluster with a mixture of spot and on-demand nodegroups. See multi-instance documentation for aws and gcp for more details.

Bug fixes

Fix Python client not respecting CORTEXCLICONFIG_DIR environment variable for client-id.txt https://github.com/cortexlabs/cortex/pull/1953 (jackmpcollins)
Prevent threads from being stuck in DynamicBatcher https://github.com/cortexlabs/cortex/pull/1915 (cbensimon)
Fix unexpected cortex logs termination by increasing buffer size https://github.com/cortexlabs/cortex/pull/1939 (vishalbollu)
Decouple cluster deletion from EBS volume deletion for cortex cluster down https://github.com/cortexlabs/cortex/pull/1954 (deliahu)
Fix spot/on-demand GPU instances not joining the cluster by upgrading to eksctl 0.40.0 https://github.com/cortexlabs/cortex/pull/1955 (vishalbollu)
Prevent premature queue not found errors by preserving the SQS for minutes till after the job has completed https://github.com/cortexlabs/cortex/pull/1952 (vishalbollu)

Docs

Update docs https://github.com/cortexlabs/cortex/pull/1949 (ospillinger)

Misc

Configure a default cortex client to manage APIs from with cortex workloads https://github.com/cortexlabs/cortex/pull/1942 https://github.com/cortexlabs/cortex/issues/1644 (RobertLucian)
Save batch metrics to cloud to preserve job metrics history https://github.com/cortexlabs/cortex/pull/1940 (vishalbollu)

- Go
Published by vishalbollu about 5 years ago

cortex - v0.30.0

v0.30.0

New features

Record custom metrics from predictors and view them in Grafana (docs) https://github.com/cortexlabs/cortex/pull/1910 https://github.com/cortexlabs/cortex/issues/1897 (miguelvr)
Add granular pod metrics to the Grafana dashboards https://github.com/cortexlabs/cortex/pull/1905 (RobertLucian)
Add node metrics to Grafana dashboards https://github.com/cortexlabs/cortex/pull/1900 (miguelvr)

Breaking changes

Remove support for installing Cortex on your own Kubernetes Cluster https://github.com/cortexlabs/cortex/pull/1921 (RobertLucian)

Bug fixes

Fix bug where successfully completed jobs were marked as completed with errors https://github.com/cortexlabs/cortex/pull/1913 (vishalbollu)
Fix bug where batch jobs were being terminated unnecessarily https://github.com/cortexlabs/cortex/pull/1917 (vishalbollu)
Prevent cluster autoscaler from reallocating job pods https://github.com/cortexlabs/cortex/pull/1919 (vishalbollu)
Address AWS cluster up quota issues such not enough NAT Gateways or EIPs https://github.com/cortexlabs/cortex/pull/1912 (RobertLucian)
Delete unused prometheus volume on cluster down https://github.com/cortexlabs/cortex/pull/1863 (miguelvr)
Create .cortex dir if not present https://github.com/cortexlabs/cortex/pull/1909 (RobertLucian)

Docs

Add docs for accessing dashboard through private load balancer (docs) https://github.com/cortexlabs/cortex/pull/1907 (deliahu)

Misc

Allow specifying paths for requirements.txt, conda-packages.txt & dependencies.sh (docs) https://github.com/cortexlabs/cortex/pull/1896 https://github.com/cortexlabs/cortex/pull/1927 https://github.com/cortexlabs/cortex/issues/1777 (miguelvr)
Log relevant kubernetes events to API specific log streams https://github.com/cortexlabs/cortex/pull/1906 https://github.com/cortexlabs/cortex/issues/833 (miguelvr)
Support credentials using AWSSESSIONTOKEN with the CLI/Client (docs) https://github.com/cortexlabs/cortex/pull/1908 https://github.com/cortexlabs/cortex/pull/1920 https://github.com/cortexlabs/cortex/issues/1134 https://github.com/cortexlabs/cortex/issues/1865 (vishalbollu)
Provide auth to Operator and APIs by attaching IAM policies to the cluster (docs) https://github.com/cortexlabs/cortex/pull/1908 https://github.com/cortexlabs/cortex/issues/1858 (vishalbollu)

- Go
Published by vishalbollu over 5 years ago

cortex - v0.29.0

v0.29.0

New features

Add Grafana dashboard for APIs (docs) https://github.com/cortexlabs/cortex/pull/1867 https://github.com/cortexlabs/cortex/pull/1885 https://github.com/cortexlabs/cortex/pull/1890 https://github.com/cortexlabs/cortex/pull/1887 (miguelvr)
Support API autoscaling in GCP clusters (docs) https://github.com/cortexlabs/cortex/pull/1814 https://github.com/cortexlabs/cortex/pull/1879 https://github.com/cortexlabs/cortex/issues/1601 (miguelvr)
Support traffic splitting in GCP clusters (docs) https://github.com/cortexlabs/cortex/pull/1892 https://github.com/cortexlabs/cortex/issues/1660 (miguelvr)

Breaking changes

The default Docker images for APIs have been slimmed down to not include packages other than what Cortex requires to function. Therefore, when deploying APIs, it is now necessary to include the dependencies that your predictor needs in requirements.txt (docs) and/or dependencies.sh (docs).

Bug fixes

Disable dynamic batcher for TensorFlow predictor type https://github.com/cortexlabs/cortex/pull/1888 (miguelvr)
Support empty directory objects for models saved in S3/GCS https://github.com/cortexlabs/cortex/pull/1830 https://github.com/cortexlabs/cortex/issues/1829 (RobertLucian)
Fix bug which prevented Task APIs on GCP from being cleaned up after completion https://github.com/cortexlabs/cortex/pull/1871 (RobertLucian)

Docs

Add documentation for using a version of Python other than the default via dependencies.sh (docs) or custom images (docs) https://github.com/cortexlabs/cortex/pull/1862 https://github.com/cortexlabs/cortex/issues/1779 (RobertLucian)

Misc

Support deploying predictor Python classes from more environments (e.g. from separate Python files, AWS Lambda) https://github.com/cortexlabs/cortex/pull/1883 https://github.com/cortexlabs/cortex/commit/3a1b777d06e660a49b6223badda4c5e8b1fe4ec1 https://github.com/cortexlabs/cortex/issues/1824 https://github.com/cortexlabs/cortex/issues/1826 (vishalbollu)
Improve error logging for Batch and Task APIs https://github.com/cortexlabs/cortex/pull/1866 https://github.com/cortexlabs/cortex/issues/1833 (RobertLucian)

- Go
Published by deliahu over 5 years ago

cortex - v0.28.0

v0.28.0

New features

Support installing Cortex on an existing Kubernetes cluster (on AWS or GCP) (docs) https://github.com/cortexlabs/cortex/pull/1837 https://github.com/cortexlabs/cortex/issues/1808 (vishalbollu)

Breaking changes

The cloudwatch dashboard has been removed as a result of our switch to Prometheus for metrics aggregation. The dashboard will be replaced with an alternative in an upcoming release.

Bug fixes

Fix bug which can cause requests to APIs from a Python client to timeout during cluster autoscaling https://github.com/cortexlabs/cortex/pull/1841 https://github.com/cortexlabs/cortex/issues/1840 (RobertLucian)
Fix bug which can cause downscale_stabilization_period to be disregarded during downscaling https://github.com/cortexlabs/cortex/pull/1847 https://github.com/cortexlabs/cortex/issues/1846 (RobertLucian)

Misc

AWS credentials are no longer required to connect the CLI to the cluster operator. If you need to restrict access to your cluster operator, configure the operator's load balancer to be private by setting operator_load_balancer_scheme: internal in your cluster configuration file, and set up VPC Peering. We plan in supporting a new auth strategy in an upcoming release.
Improve S6 error code/signal handling https://github.com/cortexlabs/cortex/pull/1825 https://github.com/cortexlabs/cortex/issues/1703 (RobertLucian)

- Go
Published by deliahu over 5 years ago

cortex - v0.27.0

v0.27.0

New features

Add new API type TaskAPI for running arbitrary Python jobs (docs) https://github.com/cortexlabs/cortex/pull/1717 https://github.com/cortexlabs/cortex/issues/253 (miguelvr, RobertLucian)
Write Cortex's logs as structured logs, and allow use of Cortex's structured logger in predictors (supports adding extra fields) (aws docs, gcp docs) https://github.com/cortexlabs/cortex/pull/1778 https://github.com/cortexlabs/cortex/pull/1803 https://github.com/cortexlabs/cortex/pull/1804 https://github.com/cortexlabs/cortex/issues/1732 https://github.com/cortexlabs/cortex/issues/1563 (vishalbollu)
Support preemptible instances on GCP (docs) https://github.com/cortexlabs/cortex/pull/1791 https://github.com/cortexlabs/cortex/issues/1631 (RobertLucian)
Support private load balancers on GCP (docs) https://github.com/cortexlabs/cortex/pull/1786 https://github.com/cortexlabs/cortex/issues/1621 (deliahu)
Support GCP instances with multiple GPUs (docs) https://github.com/cortexlabs/cortex/pull/1789 https://github.com/cortexlabs/cortex/issues/1784 (deliahu)

Breaking changes

cortex logs now streams logs from a single replica at random when there are multiple replicas for an API. The recommended way to analyze production logs is via a dedicated logging tool (by default, logs are sent to CloudWatch on AWS and StackDriver on GCP)

Bug fixes

Misc Python client fixes https://github.com/cortexlabs/cortex/pull/1798 https://github.com/cortexlabs/cortex/pull/1782 https://github.com/cortexlabs/cortex/pull/1772 (vishalbollu, RobertLucian)

Docs

Document the shared /mnt directory for TensorFlow predictors https://github.com/cortexlabs/cortex/pull/1802 https://github.com/cortexlabs/cortex/issues/1792 (deliahu)
Misc GCP docs improvements https://github.com/cortexlabs/cortex/pull/1799 (deliahu)

Misc

Improve out-of-memory status reporting (RobertLucian)
Improve batch job cleanup process https://github.com/cortexlabs/cortex/pull/1797 https://github.com/cortexlabs/cortex/pull/1796 (vishalbollu)
Remove grpc msg send/receive limit https://github.com/cortexlabs/cortex/pull/1769 https://github.com/cortexlabs/cortex/issues/1740 (RobertLucian)

- Go
Published by deliahu over 5 years ago

cortex - v0.26.0

v0.26.0

New features

Support configuring the log level for APIs (docs) https://github.com/cortexlabs/cortex/pull/1741 https://github.com/cortexlabs/cortex/issues/1484 (RobertLucian)
Support creating a cluster in an existing AWS VPC (docs) https://github.com/cortexlabs/cortex/pull/1759 https://github.com/cortexlabs/cortex/issues/1142 (deliahu)
Support specifying the GCP network and subnet for the Cortex cluster (docs) https://github.com/cortexlabs/cortex/pull/1752 https://github.com/cortexlabs/cortex/issues/1738 (deliahu)
Support configuring shared memory size (shm) for inter-process communication (docs) https://github.com/cortexlabs/cortex/pull/1756 https://github.com/cortexlabs/cortex/issues/1638 (vishalbollu)

Breaking changes

The local provider has been removed. The best way to test your predictor implementation locally is to import it in a separate Python file and call your __init__() and predict() functions directly. The best way to test your API is to deploy it to a dev/test cluster.
Built-in support for API Gateway has been removed. If you need to create an https endpoint with valid certs, some options are to set up a custom domain or to manually create an API Gateway.
Prediction monitoring has been removed. We are exploring how to build a more powerful and customizable solution for this.
The predict CLI command has been deleted. curl, requests, etc. are the best tools for testing APIs.

Bug fixes

For multi-model APIs, allow model names to share a prefix https://github.com/cortexlabs/cortex/pull/1745 https://github.com/cortexlabs/cortex/issues/1699 (RobertLucian)

Docs

Misc docs improvements (ospillinger)

- Go
Published by deliahu over 5 years ago

cortex - v0.25.0

v0.25.0

New features

Support server-side micro batching for the Python predictor (docs) https://github.com/cortexlabs/cortex/pull/1653 https://github.com/cortexlabs/cortex/issues/1382 (miguelvr)
Add timeout configuration for batch jobs (docs) https://github.com/cortexlabs/cortex/pull/1712 https://github.com/cortexlabs/cortex/issues/1324 (vishalbollu)
Support batch retries (docs) https://github.com/cortexlabs/cortex/pull/1713 https://github.com/cortexlabs/cortex/issues/1540 (lapaniku, vishalbollu)
Support sending failed batches to a dead-letter queue (docs) https://github.com/cortexlabs/cortex/pull/1713 https://github.com/cortexlabs/cortex/issues/1541 (lapaniku, vishalbollu)
Support installing the cortex Python client in predictors https://github.com/cortexlabs/cortex/pull/1709 https://github.com/cortexlabs/cortex/issues/1670 https://github.com/cortexlabs/cortex/issues/1206 (RobertLucian)

Breaking changes

The predictor.model_path field of the realtime api configuration has been moved to predictor.models.path. In addition, for the Python predictor type, predictor.models has been renamed to predictor.multi_model_reloading. Here is the entire API configuration schema.

Bug fixes

Misc batch reliability improvements https://github.com/cortexlabs/cortex/pull/1705 https://github.com/cortexlabs/cortex/pull/1718 https://github.com/cortexlabs/cortex/pull/1729 (vishalbollu)

Docs

Reorganize the docs structure https://github.com/cortexlabs/cortex/pull/1696 https://github.com/cortexlabs/cortex/pull/1701 https://github.com/cortexlabs/cortex/pull/1704 https://github.com/cortexlabs/cortex/pull/1719 https://github.com/cortexlabs/cortex/issues/1675 (ospillinger)
Add GCP to the contributing guide https://github.com/cortexlabs/cortex/pull/1720 https://github.com/cortexlabs/cortex/issues/1654 (deliahu)
Add docs for setting up kubectl on GCP https://github.com/cortexlabs/cortex/commit/759b4b144c25cc623e1b385b036f83825d122db7 (deliahu)

Misc

Parse the request body as a string when content type text/plain is specified https://github.com/cortexlabs/cortex/pull/1714 (deliahu)
Support paths to single ONNX files in API configuration https://github.com/cortexlabs/cortex/pull/1711 https://github.com/cortexlabs/cortex/issues/1686 (RobertLucian)
Support deploying public S3 models on GCP, and public GCS models on AWS https://github.com/cortexlabs/cortex/pull/1694 https://github.com/cortexlabs/cortex/issues/1684 (RobertLucian)
Pre-download docker images when creating GCP clusters https://github.com/cortexlabs/cortex/pull/1721 https://github.com/cortexlabs/cortex/issues/1658 (deliahu)
Speed up the validation processes for multi-model APIs https://github.com/cortexlabs/cortex/pull/1690 https://github.com/cortexlabs/cortex/issues/1663 (RobertLucian)

- Go
Published by deliahu over 5 years ago

cortex - v0.24.1

v0.24.1

Bug fixes

Propagate the exit code from the predictor's initialization so that the API status is set to "error" when initialization fails https://github.com/cortexlabs/cortex/issues/1680 https://github.com/cortexlabs/cortex/pull/1691 (RobertLucian)

- Go
Published by deliahu over 5 years ago

cortex - v0.24.0

v0.24.0

New features

Add GCP support: our initial release supports all three predictor types (Python, TensorFlow, ONNX), on CPU or GPU, with live reloading, multi-model caching, and cluster autoscaling https://github.com/cortexlabs/cortex/pull/1655 https://github.com/cortexlabs/cortex/pull/1672 https://github.com/cortexlabs/cortex/pull/1667 https://github.com/cortexlabs/cortex/issues/1661 https://github.com/cortexlabs/cortex/issues/114 https://github.com/cortexlabs/cortex/issues/1600 https://github.com/cortexlabs/cortex/issues/1602 https://github.com/cortexlabs/cortex/issues/1616 https://github.com/cortexlabs/cortex/issues/1624 (RobertLucian, deliahu, vishalbollu)
Add the patch command to the CLI and Python client, which can be used to update an API using only the API configuration (without needing to provide the predictor's Python implementation) https://github.com/cortexlabs/cortex/pull/1651 https://github.com/cortexlabs/cortex/pull/1666 https://github.com/cortexlabs/cortex/issues/1329 (vishalbollu)
Support deploying predictor Python classes from the Python client https://github.com/cortexlabs/cortex/pull/1587 https://github.com/cortexlabs/cortex/issues/1617 (see the tutorial for an example) (vishalbollu)

Breaking changes

The Python client's deploy() function has been renamed to create_api(), and some of the argument names have changed (docs)

Bug fixes

Enable CORS for APIs accessed via API Gateway or load balancer https://github.com/cortexlabs/cortex/pull/1649 https://github.com/cortexlabs/cortex/issues/1234 (RobertLucian, deliahu)
Fix local TensorFlow models when live reloading is enabled https://github.com/cortexlabs/cortex/pull/1668 https://github.com/cortexlabs/cortex/issues/1554 (RobertLucian)
Prevent TensorFlow multi-model caching from attempting to download local models from S3 https://github.com/cortexlabs/cortex/pull/1669 https://github.com/cortexlabs/cortex/issues/1598 (RobertLucian)

Docs

Miscellaneous docs improvements (vishalbollu, ospillinger)

Misc

Improve Python client cross Python version compatibility https://github.com/cortexlabs/cortex/pull/1640 (vishalbollu)
Reinstall TensorFlow and ONNX dependencies when the Python version is overridden https://github.com/cortexlabs/cortex/pull/1652 (vishalbollu)
Terminate container when bootloader script fails https://github.com/cortexlabs/cortex/pull/1639 (vishalbollu)

- Go
Published by deliahu over 5 years ago

cortex - v0.23.0

v0.23.0

New features

Update Python client deploy() to accept a Python dictionary for API configuration (previously, only a file path was supported) (docs) https://github.com/cortexlabs/cortex/pull/1587 (vishalbollu)
Show API deployment history in cortex get API_NAME command https://github.com/cortexlabs/cortex/pull/1544 https://github.com/cortexlabs/cortex/issues/1496 (deliahu)
Add cortex export API_NAME and cortex export API_NAME API_ID commands to export specific and historical API deployments https://github.com/cortexlabs/cortex/pull/1544 https://github.com/cortexlabs/cortex/issues/1497 (deliahu)
Build and push python-predictor-gpu-slim image with different combinations of cuda and cudnn (cuda10.0-cudnn7, cuda10.1-cudnn7, cuda10.1-cudnn8, cuda10.2-cudnn7, cuda10.2-cudnn8, cuda11.0-cudnn8, cuda11.1-cudnn8) (docs) https://github.com/cortexlabs/cortex/pull/1575 https://github.com/cortexlabs/cortex/issues/1574 (deliahu)

Bug fixes

Allow local deployments of public S3 models without requiring AWS credentials https://github.com/cortexlabs/cortex/pull/1589 https://github.com/cortexlabs/cortex/issues/1588 (RobertLucian)

Docs

Add guide for avoiding Docker Hub rate limits https://github.com/cortexlabs/cortex/pull/1576 (RobertLucian, deliahu)
Add guide for self-hosting Cortex's Docker images https://github.com/cortexlabs/cortex/pull/1579 (RobertLucian, deliahu)

Misc

Remove API request maximum payload size limit https://github.com/cortexlabs/cortex/pull/1583 (deliahu)
Switch to Quay docker container registry https://github.com/cortexlabs/cortex/pull/1578 (deliahu, RobertLucian)

- Go
Published by deliahu over 5 years ago

cortex - v0.22.1

v0.22.1

Bug fixes

Set the predictor's working directory to the root Cortex project directory https://github.com/cortexlabs/cortex/pull/1573 https://github.com/cortexlabs/cortex/issues/1572 (deliahu)
Allow max_instances to be updated via cortex cluster configure https://github.com/cortexlabs/cortex/pull/1568 https://github.com/cortexlabs/cortex/issues/1567 (deliahu)
Gracefully stop the serving container when a multi-processed cron throws exception https://github.com/cortexlabs/cortex/pull/1560 https://github.com/cortexlabs/cortex/issues/1552 (RobertLucian)

Docs

Demonstrate how to make API requests with various payload types (binary, form fields, etc), and show how to access them in predict() https://github.com/cortexlabs/cortex/pull/1566 (docs)
Misc docs improvements https://github.com/cortexlabs/cortex/pull/1551 https://github.com/cortexlabs/cortex/pull/1556 c3dab4045a61703cb1db1d5f95776614252f96c0 https://github.com/cortexlabs/cortex/pull/1557 (deliahu, RobertLucian)

Misc

Build and upload the Python package/CLI to a public S3 bucket https://github.com/cortexlabs/cortex/pull/1562 (vishalbollu)

- Go
Published by deliahu over 5 years ago

cortex - v0.22.0

v0.22.0

New features

Multi-model caching: serve a collection of models that is collectively bigger than what will fit in memory (via LRU cache eviction) (docs) https://github.com/cortexlabs/cortex/pull/1428 https://github.com/cortexlabs/cortex/issues/619 (RobertLucian)
Live reloading: support updating models in running APIs by adding new versions to the model's S3 directory (docs) https://github.com/cortexlabs/cortex/pull/1428 https://github.com/cortexlabs/cortex/issues/1252 (RobertLucian)
Inter-process fairness: distribute requests within an API replica evenly across all processes https://github.com/cortexlabs/cortex/pull/1526 https://github.com/cortexlabs/cortex/issues/839 https://github.com/cortexlabs/cortex/issues/1298 (RobertLucian)
Support requests between APIs within the same cluster (docs) https://github.com/cortexlabs/cortex/pull/1503 https://github.com/cortexlabs/cortex/issues/1241 (deliahu)
Allow overriding of CLI install path and config directory (via $CORTEX_INSTALL_PATH and $CORTEX_CLI_CONFIG_DIR) (docs) https://github.com/cortexlabs/cortex/pull/1521 https://github.com/cortexlabs/cortex/issues/1222 (deliahu)

Breaking changes

ONNX model paths in API configuration files must now point to a directory containing a single ONNX file, rather than the onnx file itself. For example model_path: s3://cortex-examples/onnx/yolov5-youtube/yolov5s.onnx becomes model_path: s3://cortex-examples/onnx/yolov5-youtube.
The --env/-e flag in all cortex cluster commands has been renamed to --configure-env/-e, and if not provided, the environment named aws will no longer be configured in the cortex cluster info command

Bug fixes

Fix intermittent failed requests during rolling updates https://github.com/cortexlabs/cortex/pull/1526 https://github.com/cortexlabs/cortex/issues/814 (RobertLucian)
Prevent CLI environments from getting overwritten when multiple cortex cluster commands are run concurrently https://github.com/cortexlabs/cortex/pull/1520 https://github.com/cortexlabs/cortex/issues/1410 (deliahu)

Docs

Add Python client docs https://github.com/cortexlabs/cortex/pull/1519 https://github.com/cortexlabs/cortex/issues/1502 (deliahu)
Add guide for running in production https://github.com/cortexlabs/cortex/pull/1513 https://github.com/cortexlabs/cortex/issues/1464 https://github.com/cortexlabs/cortex/issues/1257 (deliahu)
Add guide for low-cost clusters https://github.com/cortexlabs/cortex/pull/1514 https://github.com/cortexlabs/cortex/issues/1425 (deliahu)
Add guide for using a REST API Gateway https://github.com/cortexlabs/cortex/pull/1505 https://github.com/cortexlabs/cortex/issues/1228 (deliahu)
Add guide for troubleshooting cortex cluster down failures https://github.com/cortexlabs/cortex/pull/1515 https://github.com/cortexlabs/cortex/issues/1319 (deliahu)

Misc

Stagger Predictor __init__() calls to reduce peak memory consumption https://github.com/cortexlabs/cortex/pull/1543 https://github.com/cortexlabs/cortex/issues/1450 (RobertLucian)
Add --name/-n and --region/-r flags to cortex cluster info, cortex cluster export, and cortex cluster down commands https://github.com/cortexlabs/cortex/pull/1492 https://github.com/cortexlabs/cortex/issues/1363 (RobertLucian)
Rename --env/-e flag to --configure-env/-e in cortex cluster commands and update its behavior https://github.com/cortexlabs/cortex/pull/1533 https://github.com/cortexlabs/cortex/issues/1412 (deliahu)
Disallow ARM-based instances, which are not currently supported https://github.com/cortexlabs/cortex/pull/1536 (deliahu)
Validate AWS vCPU quota is sufficient for up to max_instances instances when running cortex cluster up and cortex cluster configure https://github.com/cortexlabs/cortex/pull/1537 https://github.com/cortexlabs/cortex/issues/1461 (deliahu)

- Go
Published by deliahu over 5 years ago

cortex - v0.21.0

New features

Add Python client: pypi.org/project/cortex https://github.com/cortexlabs/cortex/pull/1449 https://github.com/cortexlabs/cortex/issues/684 (vishalbollu)
Add support for private docker image registries (docs) https://github.com/cortexlabs/cortex/pull/1460 https://github.com/cortexlabs/cortex/issues/1113 (deliahu)

Bug fixes

Fix minor BatchAPI bugs https://github.com/cortexlabs/cortex/pull/1471 https://github.com/cortexlabs/cortex/pull/1468 https://github.com/cortexlabs/cortex/pull/1480 https://github.com/cortexlabs/cortex/issues/1473 (vishalbollu, RobertLucian)
Bypass instance limit check if AWS's API doesn't provide quota information (this was blocking cluster creation in eu-north-1) https://github.com/cortexlabs/cortex/pull/1439 https://github.com/cortexlabs/cortex/issues/1438 (deliahu)

Docs

Add a guide for how to install the CLI on Windows https://github.com/cortexlabs/cortex/pull/1476 https://github.com/cortexlabs/cortex/issues/715 (RobertLucian)

Misc

Change default local port from 8888 to 8890 to avoid port conflicts with Jupyter https://github.com/cortexlabs/cortex/pull/1456 (vishalbollu)
Disallow instance types that aren't supported by NLB https://github.com/cortexlabs/cortex/pull/1436 https://github.com/cortexlabs/cortex/issues/1433 (deliahu)
Add --cluster-aws-key and --cluster-aws-secret flags to cortex cluster configure command https://github.com/cortexlabs/cortex/pull/1404 (deliahu)
Add --output flag to cortex env list command https://github.com/cortexlabs/cortex/pull/1444 (vishalbollu)

- Go
Published by vishalbollu over 5 years ago

cortex - v0.20.0

v0.20.0

New features

Add cortex cluster export command to export all APIs running in a cluster (docs) https://github.com/cortexlabs/cortex/pull/1368 https://github.com/cortexlabs/cortex/issues/1255 (vishalbollu)
Enable users to specify CIDR ranges for the cluster's VPC (docs) https://github.com/cortexlabs/cortex/pull/1388 (vishalbollu)
Support json output for CLI commands (via -o/--output json) https://github.com/cortexlabs/cortex/pull/1365 https://github.com/cortexlabs/cortex/issues/1161 (vishalbollu)
Support the nvidia device driver (nvidia-container-toolkit) when running locally https://github.com/cortexlabs/cortex/pull/1366 https://github.com/cortexlabs/cortex/issues/1223 (vishalbollu)

Breaking changes

The valid values for api_gateway in the cluster configuration file have been changed from enabled/disabled to public/none (to match the values for networking.api_gateway in the API configuration file).

Bug fixes

Support AWS tags with spaces and valid special characters https://github.com/cortexlabs/cortex/pull/1374 https://github.com/cortexlabs/cortex/pull/1355 https://github.com/cortexlabs/cortex/pull/1380 https://github.com/cortexlabs/cortex/pull/1385 https://github.com/cortexlabs/cortex/issues/1373 (deliahu)
Fix tensor shape validation for the TensorFlow predictor https://github.com/cortexlabs/cortex/pull/1311 https://github.com/cortexlabs/cortex/issues/1310 (RobertLucian)
Allow cortex cluster * commands to be run from within a docker container https://github.com/cortexlabs/cortex/pull/1370 https://github.com/cortexlabs/cortex/issues/1361 https://github.com/cortexlabs/cortex/issues/1325 (deliahu)

New examples

pytorch/question-generator to generate questions given text and the correct answer (uses transformers and spacy) https://github.com/cortexlabs/cortex/pull/1308 (ismaelc)

Docs

Add documentation for how to install a specific version of the CLI https://github.com/cortexlabs/cortex/pull/1386 https://github.com/cortexlabs/cortex/issues/1244 (vishalbollu)
Add sections for overprovisioning and responsiveness to autoscaling docs https://github.com/cortexlabs/cortex/pull/1397 (deliahu)
Add documentation for how to allow IAM users who did not create the cortex cluster to run cortex cluster * commands https://github.com/cortexlabs/cortex/pull/1392 https://github.com/cortexlabs/cortex/issues/1391 (deliahu)
Add guide for setting up kubectl to access the cluster https://github.com/cortexlabs/cortex/pull/1344 https://github.com/cortexlabs/cortex/issues/1343 (RobertLucian)

Misc

Update sources of AWS credentials for cortex cluster * commands, and improve transparency (docs) https://github.com/cortexlabs/cortex/pull/1378 https://github.com/cortexlabs/cortex/issues/1229 (vishalbollu)
Rename cluster api_gateway config values to match API config https://github.com/cortexlabs/cortex/pull/1335 https://github.com/cortexlabs/cortex/issues/1334 (deliahu)
Set the default value for networking.api_gateway in the API configuration to none if api gateway is disabled cluster-wide https://github.com/cortexlabs/cortex/pull/1337 https://github.com/cortexlabs/cortex/issues/1336 (deliahu)
Support c6g and r6g instances https://github.com/cortexlabs/cortex/pull/1332 https://github.com/cortexlabs/cortex/issues/809 (deliahu)
Display autoscaling group activity history when cortex cluster up fails https://github.com/cortexlabs/cortex/pull/1342 https://github.com/cortexlabs/cortex/issues/1340 (deliahu)
Print debug info if cortex cluster up times out https://github.com/cortexlabs/cortex/pull/1396 (deliahu)
Add Inferentia compute statistics to cortex cluster info command https://github.com/cortexlabs/cortex/pull/1354 https://github.com/cortexlabs/cortex/issues/1304 (RobertLucian)
Disable prompts in get-cli.sh if not running interactively https://github.com/cortexlabs/cortex/pull/1372 https://github.com/cortexlabs/cortex/issues/1371 (deliahu)
Update cortex help output https://github.com/cortexlabs/cortex/pull/1398 (deliahu)

- Go
Published by RobertLucian over 5 years ago

cortex - v0.19.0

New features

Support batch APIs docs https://github.com/cortexlabs/cortex/pull/1203 https://github.com/cortexlabs/cortex/issues/523 (vishalbollu)
Support traffic splitting (enables A/B testing, multi-armed bandit, etc) docs https://github.com/cortexlabs/cortex/pull/1213 https://github.com/cortexlabs/cortex/pull/1270 https://github.com/cortexlabs/cortex/issues/1132 https://github.com/cortexlabs/cortex/issues/275 https://github.com/cortexlabs/cortex/issues/1089 (tthebst)
Support server-side request batching for the TensorFlow Predictor docs https://github.com/cortexlabs/cortex/pull/1193 https://github.com/cortexlabs/cortex/issues/1060 (RobertLucian)
Add post_predict() method to Predictor interface (runs after the response has been sent) docs https://github.com/cortexlabs/cortex/pull/1237 https://github.com/cortexlabs/cortex/issues/954 (RobertLucian)
Support disabling API Gateway cluster-wide docs https://github.com/cortexlabs/cortex/pull/1259 https://github.com/cortexlabs/cortex/issues/1198 (deliahu)
Support different CUDA versions for the slim Python Predictor image docs https://github.com/cortexlabs/cortex/pull/1263 https://github.com/cortexlabs/cortex/issues/923 https://github.com/cortexlabs/cortex/issues/1254 (RobertLucian)
Add additional widgets to the CloudWatch Dashboard (avg in-flight requests per replica, active replicas) docs https://github.com/cortexlabs/cortex/pull/1181 (RobertLucian)

Breaking changes

kind is now a required top-level field for all API configurations. Existing APIs should add kind: RealtimeAPI. This release adds support for kind: BatchAPI and kind: TrafficSplitter.

Bug fixes

Fix python_path config field https://github.com/cortexlabs/cortex/pull/1202 (deliahu)
Fix local TensorFlow deploy from parent directory https://github.com/cortexlabs/cortex/pull/1274 (deliahu)
Improve error response for invalid payloads https://github.com/cortexlabs/cortex/pull/1212 https://github.com/cortexlabs/cortex/issues/1208 (RobertLucian)

New examples

onnx/yolov5-youtube https://github.com/cortexlabs/cortex/pull/1201 (dsuess)
Update PyTorch text generator example to use Hugging Face transfomers GPT-2 model https://github.com/cortexlabs/cortex/pull/1177 (ospillinger)

Docs

Update tutorial to use the pytorch text-generator example https://github.com/cortexlabs/cortex/pull/1278 https://github.com/cortexlabs/cortex/issues/1256 (deliahu)
Improve instructions for updating cluster without downtime https://github.com/cortexlabs/cortex/pull/1261 (deliahu)
Mention API Gateway timeout in 404/503 API responses guide https://github.com/cortexlabs/cortex/pull/1264 https://github.com/cortexlabs/cortex/issues/1225 (deliahu)

Misc

Set tags on log groups https://github.com/cortexlabs/cortex/pull/1164 https://github.com/cortexlabs/cortex/issues/1078 (tthebst)
Display API metrics in the CLI by API ID (rather than by API name) https://github.com/cortexlabs/cortex/pull/1216 (vishalbollu)
Fix recursive error message for deploy/delete CLI commands https://github.com/cortexlabs/cortex/pull/1247 https://github.com/cortexlabs/cortex/issues/1218 (RobertLucian)
Add shell completion to .zshrc file during CLI installation https://github.com/cortexlabs/cortex/pull/1265 https://github.com/cortexlabs/cortex/issues/1221 (deliahu)
Handle OOM error when project files are too large https://github.com/cortexlabs/cortex/pull/1217 (RobertLucian)
Display image pull errors https://github.com/cortexlabs/cortex/pull/1167 https://github.com/cortexlabs/cortex/issues/955 (deliahu)
Display local Docker image pull error when out of space https://github.com/cortexlabs/cortex/pull/1238 https://github.com/cortexlabs/cortex/issues/1236 (zouyee)

- Go
Published by vishalbollu almost 6 years ago

cortex - v0.18.1

Bug fixes

Fix dynamic axes for ONNX models https://github.com/cortexlabs/cortex/pull/1187 https://github.com/cortexlabs/cortex/issues/1186 (RobertLucian)
Fix memory node capacity calculation for multi-api configuration files https://github.com/cortexlabs/cortex/pull/1185 (deliahu)
Check cluster-name tag when choosing load balancer for VPC Link integration https://github.com/cortexlabs/cortex/pull/1173 (deliahu)

New guides

Misc

Delete API Gateway if cluster up fails https://github.com/cortexlabs/cortex/pull/1172 (deliahu)
Move image version verification from serve.py to run.sh https://github.com/cortexlabs/cortex/pull/1180 https://github.com/cortexlabs/cortex/pull/1183 (vishalbollu)
Add retries for resource tagging during cluster up https://github.com/cortexlabs/cortex/pull/1188 (deliahu)
Use info log level when TensorFlow model is being loaded https://github.com/cortexlabs/cortex/pull/1171 (RobertLucian)
Increase max number of processes per API replica to 100 https://github.com/cortexlabs/cortex/pull/1166 (RobertLucian)
Allow empty cluster config https://github.com/cortexlabs/cortex/pull/1179 (deliahu)

- Go
Published by deliahu almost 6 years ago

cortex - v0.18.0

New features

Support Inferentia instances https://github.com/cortexlabs/cortex/pull/1119 https://github.com/cortexlabs/cortex/issues/654 (RobertLucian)
Automatically provision HTTPS API Gateway endpoints for Cortex APIs https://github.com/cortexlabs/cortex/pull/1108 https://github.com/cortexlabs/cortex/issues/1077 (tthebst)
Support multi-model endpoints for TensorFlow and ONNX predictors https://github.com/cortexlabs/cortex/pull/1107 https://github.com/cortexlabs/cortex/issues/890 (RobertLucian)
Support local Docker images in the local environment https://github.com/cortexlabs/cortex/pull/1114 https://github.com/cortexlabs/cortex/issues/1094 (RobertLucian)
Support replica parallelism fields (processes_per_replica and threads_per_process) in the local environment https://github.com/cortexlabs/cortex/pull/1158 https://github.com/cortexlabs/cortex/issues/960 https://github.com/cortexlabs/cortex/issues/1090 (RobertLucian)
Support a .env file to export environment variables in the API container https://github.com/cortexlabs/cortex/pull/1154 https://github.com/cortexlabs/cortex/issues/1147 (RobertLucian, spentaur)

Breaking changes

autoscaling.workers_per_replica and autoscaling.threads_per_worker have been moved/renamed to predictor.processes_per_replica and predictor.threads_per_process (see API configuration docs)
endpoint and local_port have been moved to a new sub-field called networking (see API configuration docs)
model has been renamed to model_path in TensorFlow and ONNX predictors (see API configuration docs)

Bug fixes

Prevent GPU overprovisioning during autoscaling https://github.com/cortexlabs/cortex/pull/1111 https://github.com/cortexlabs/cortex/issues/1085 (vishalbollu)

New examples

tensorflow/image-classifier-resnet50 and pytorch/image-classifier-resnet50 to demonstrate using Inferentia https://github.com/cortexlabs/cortex/pull/1119 (RobertLucian)
pytorch/multi-model-text-analyzer, tensorflow/multi-model-classifier, and onnx/multi-model-classifier to demonstrate multi-model APIs https://github.com/cortexlabs/cortex/pull/1107 (RobertLucian)

New guides

Multi-model endpoints using TensorFlow and ONNX predictors https://github.com/cortexlabs/cortex/pull/1107 (RobertLucian)

Docs

Add API architecture diagram https://github.com/cortexlabs/cortex/pull/1126 (deliahu)
Add documentation for configuring the CLI on a new machine https://github.com/cortexlabs/cortex/pull/1127 (deliahu, javithe7)

Misc

Call predictor __init__() from the request's threadpool to avoid mutlithreadding issues in some ML frameworks when using 1 thread https://github.com/cortexlabs/cortex/pull/1146 (deliahu)
Allow changing an API's endpoint to not trigger a rolling replica update https://github.com/cortexlabs/cortex/pull/1155 https://github.com/cortexlabs/cortex/issues/1116 (deliahu)
Set the default shell to bash in Predictor Dockerfiles (which simplifies using them as base images for custom-built images) https://github.com/cortexlabs/cortex/pull/1104 https://github.com/cortexlabs/cortex/issues/1086 (RobertLucian)
Move endpoint and local_port to networking API config https://github.com/cortexlabs/cortex/pull/1151 https://github.com/cortexlabs/cortex/issues/1091 (deliahu)
Rename model to model_path in API config https://github.com/cortexlabs/cortex/pull/1150 https://github.com/cortexlabs/cortex/issues/1115 (deliahu)
Use cluster name for cloudwatch metrics namespace https://github.com/cortexlabs/cortex/pull/1138 (deliahu)
Misc UI improvements https://github.com/cortexlabs/cortex/pull/1159 https://github.com/cortexlabs/cortex/pull/1084 https://github.com/cortexlabs/cortex/pull/1152 https://github.com/cortexlabs/cortex/pull/1136 https://github.com/cortexlabs/cortex/pull/1128 https://github.com/cortexlabs/cortex/commit/a52f0a0a7871253c8fdd13eac7116cf4b2e961c1 https://github.com/cortexlabs/cortex/pull/1096 (deliahu)

- Go
Published by deliahu almost 6 years ago

cortex - v0.17.1

Bug fixes

Fix bug which caused installation of packages from conda-packages.txt to get stuck when running Cortex locally https://github.com/cortexlabs/cortex/pull/1101 https://github.com/cortexlabs/cortex/issues/1100 (dsuess)

New guides

Multi-model endpoints using the Python predictor https://github.com/cortexlabs/cortex/pull/1081 (deliahu)

Docs

Improve tutorial https://github.com/cortexlabs/cortex/pull/1099 https://github.com/cortexlabs/cortex/issues/1098 (ssami)
Misc docs improvements https://github.com/cortexlabs/cortex/pull/1095 https://github.com/cortexlabs/cortex/commit/e15cc62ddf49af3d41259f56a21adc6e8dabcbc3 https://github.com/cortexlabs/cortex/commit/8023ff98241b85e1d05770c52981721fb8db735f (deliahu, vishalbollu)

- Go
Published by deliahu about 6 years ago

cortex - v0.17.0

New features

Support arbitrary API request payload content types, including raw bytes and form fields (not just JSON) https://github.com/cortexlabs/cortex/pull/1062, https://github.com/cortexlabs/cortex/issues/332, https://github.com/cortexlabs/cortex/issues/917 (deliahu)
Support custom SSL certificates for the API load balancer https://github.com/cortexlabs/cortex/pull/1069, https://github.com/cortexlabs/cortex/issues/326, https://github.com/cortexlabs/cortex/issues/1066 (vishalbollu)
Add a cloudwatch dashboard to show metrics for each running API https://github.com/cortexlabs/cortex/pull/1054, https://github.com/cortexlabs/cortex/issues/855 (tthebst)
Allow for custom tagging of AWS resources created by cortex (and add the cortex.dev/cluster-name tag by default) https://github.com/cortexlabs/cortex/pull/1031, https://github.com/cortexlabs/cortex/issues/854, https://github.com/cortexlabs/cortex/issues/856 (vishalbollu)
Expose request query parameters to the predictor's predict() function https://github.com/cortexlabs/cortex/pull/1062, https://github.com/cortexlabs/cortex/issues/546 (deliahu)
Expose request headers to the predictor's predict() function https://github.com/cortexlabs/cortex/pull/1062 (deliahu)
Allow users to change the Python version via conda-packages.txt https://github.com/cortexlabs/cortex/pull/1052, https://github.com/cortexlabs/cortex/issues/1051 (RobertLucian)

Bug fixes

Fix bug which caused a validation error when running TensorFlow or ONNX locally with locally saved models https://github.com/cortexlabs/cortex/pull/1075 (RobertLucian)
Enable tty on local docker API containers to avoid strange characters showing up in cortex logs https://github.com/cortexlabs/cortex/pull/1067 (deliahu)
Allow cluster's min_instances and max_instances to be updated at the same time https://github.com/cortexlabs/cortex/pull/1050, https://github.com/cortexlabs/cortex/issues/840 (deliahu)

New examples

Update pytorch/sentiment-analyzer to use the transformers library https://github.com/cortexlabs/cortex/pull/1064 (deliahu)

New guides

Viewing API metrics on the CloudWatch dashboard (deliahu)

Docs

Add architecture diagram https://github.com/cortexlabs/cortex/pull/1042, https://github.com/cortexlabs/cortex/issues/1013 (deliahu)
Document how to install packages from private PyPI indexes https://github.com/cortexlabs/cortex/pull/1072 (RobertLucian)

Misc

Add zsh completion https://github.com/cortexlabs/cortex/pull/1024, https://github.com/cortexlabs/cortex/issues/1020 (deliahu)
Rename cortex cluster update to cortex cluster configure https://github.com/cortexlabs/cortex/pull/1035, https://github.com/cortexlabs/cortex/issues/887 (zouyee)
Add instance and pricing information to cortex cluster info output https://github.com/cortexlabs/cortex/pull/1053, https://github.com/cortexlabs/cortex/issues/835, https://github.com/cortexlabs/cortex/issues/935 (deliahu)
Rename tracker to monitoring https://github.com/cortexlabs/cortex/pull/1041, https://github.com/cortexlabs/cortex/issues/869 (deliahu)
Change the default cortex region to us-east-1 https://github.com/cortexlabs/cortex/pull/1063 (deliahu)
Disable cluster logging https://github.com/cortexlabs/cortex/pull/1029, https://github.com/cortexlabs/cortex/issues/888 (deliahu)
Add git to API images https://github.com/cortexlabs/cortex/pull/1068 (RobertLucian)
Add upper bound validation for maxreplicaconcurrency https://github.com/cortexlabs/cortex/pull/1025 (zouyee)
Misc UI/UX improvements https://github.com/cortexlabs/cortex/pull/1032, https://github.com/cortexlabs/cortex/pull/1023, https://github.com/cortexlabs/cortex/pull/1033, https://github.com/cortexlabs/cortex/pull/1036, https://github.com/cortexlabs/cortex/pull/1045, https://github.com/cortexlabs/cortex/pull/1047, https://github.com/cortexlabs/cortex/pull/1049, https://github.com/cortexlabs/cortex/pull/1044, https://github.com/cortexlabs/cortex/commit/93032e2e54c9a1876264c98b724fc29f3faf5a4d, https://github.com/cortexlabs/cortex/pull/1065, https://github.com/cortexlabs/cortex/issues/726, https://github.com/cortexlabs/cortex/issues/1048, https://github.com/cortexlabs/cortex/issues/894, https://github.com/cortexlabs/cortex/issues/1043, https://github.com/cortexlabs/cortex/issues/1028, https://github.com/cortexlabs/cortex/issues/933, https://github.com/cortexlabs/cortex/issues/1027, https://github.com/cortexlabs/cortex/issues/934, https://github.com/cortexlabs/cortex/issues/1026 (deliahu, vishalbollu)

Breaking changes

cortex cluster update has been renamed to cortex cluster configure (so it won't be misinterpreted as updating the Cortex version of the cluster)
the tracker field in API configuration has been renamed to monitoring (to be consistent with the other field names)

- Go
Published by deliahu about 6 years ago

cortex - v0.16.1

Bug fixes

Read and validate TensorFlow and ONNX models from buckets in any region https://github.com/cortexlabs/cortex/pull/1059 (vishalbollu)

- Go
Published by deliahu about 6 years ago

cortex - v0.16.0

New features

Support deploying APIs locally https://github.com/cortexlabs/cortex/pull/973 https://github.com/cortexlabs/cortex/issues/109 (vishalbollu)
Enable private networking: private subnets for instances, internal API load balancer, and internal operator load balancer https://github.com/cortexlabs/cortex/pull/978 https://github.com/cortexlabs/cortex/issues/965 https://github.com/cortexlabs/cortex/issues/832 https://github.com/cortexlabs/cortex/issues/964 (deliahu)
Support installing system packages via dependencies.sh https://github.com/cortexlabs/cortex/pull/880 https://github.com/cortexlabs/cortex/issues/852 (RobertLucian)
Support installing conda packages via conda-packages.txt https://github.com/cortexlabs/cortex/pull/880 https://github.com/cortexlabs/cortex/issues/844 (RobertLucian)
Allow for spot instances to be used with a single instance type https://github.com/cortexlabs/cortex/pull/979 https://github.com/cortexlabs/cortex/issues/886 (RobertLucian)
Support specifying serving images in API configuration (on a per-API basis) https://github.com/cortexlabs/cortex/pull/948 https://github.com/cortexlabs/cortex/issues/900 (RobertLucian)
Add cortex commands to list and remove CLI environments https://github.com/cortexlabs/cortex/pull/973 https://github.com/cortexlabs/cortex/issues/730 (deliahu)
Support bytes and starlette.responses.Response response types from predict() https://github.com/cortexlabs/cortex/pull/915 https://github.com/cortexlabs/cortex/issues/913 (RobertLucian)
Add slim predictor base images https://github.com/cortexlabs/cortex/pull/992 https://github.com/cortexlabs/cortex/issues/781 (deliahu)
Support configuring instance volume type and provisioned IOPS https://github.com/cortexlabs/cortex/pull/982 https://github.com/cortexlabs/cortex/issues/592 (tthebst)
Support highly available NAT Gateway https://github.com/cortexlabs/cortex/pull/978 https://github.com/cortexlabs/cortex/issues/963 (deliahu)
Add --yes flag to skip prompts on cluster CLI commands https://github.com/cortexlabs/cortex/pull/980 https://github.com/cortexlabs/cortex/issues/929 (deliahu)

New Examples

Bart summarizer (using PyTorch) https://github.com/cortexlabs/cortex/pull/907 (ismaelc)
Named entity recognizer (using spacy) https://github.com/cortexlabs/cortex/pull/924 (aced125)
Lite version of the license plate reader https://github.com/cortexlabs/cortex/pull/994 (RobertLucian)

New Guides

Docs

Add environment documentation (deliahu)

Misc

Use rolling updates for daemonsets when running cortex cluster update https://github.com/cortexlabs/cortex/pull/972 https://github.com/cortexlabs/cortex/issues/630 (tthebst)
Switch from Classic Elastic Load Balancer to Network Load Balancer https://github.com/cortexlabs/cortex/pull/978 https://github.com/cortexlabs/cortex/issues/966 (deliahu)
Show the original error message when encountering "invalid AWS credentials" https://github.com/cortexlabs/cortex/pull/918 (deliahu)
Log unexpected TensorFlow Serving gRPC errors https://github.com/cortexlabs/cortex/pull/949 (deliahu)

Breaking Changes

Previously, custom serving images were configured in the cluster configuration file (e.g. cluster.yaml would have a line for image_python_serve: my-repo/python-serve:latest). Now, custom images are specified inside the API configuration (e.g. cortex.yaml should have image: my-repo/python-serve:latest in the predictor section of your API configuration). Here's the full documentation for API configuration.
The names of the serving base images have been updated to be more descriptive, and "slim" images have been added (they are more appropriate to use as base images when building custom images). Here is the full documentation for custom Docker images.
The debug query parameter to APIs (which caused the input and output to predict() to be logged) has been removed (https://github.com/cortexlabs/cortex/pull/985)

- Go
Published by deliahu about 6 years ago

cortex - v0.15.1

Bug fixes

Fix admin check for root AWS users https://github.com/cortexlabs/cortex/pull/940 (deliahu)
Fix CORS https://github.com/cortexlabs/cortex/pull/942 (vishalbollu)

Misc

Print panic stack traces https://github.com/cortexlabs/cortex/pull/921 (deliahu)

- Go
Published by deliahu about 6 years ago

cortex - v0.15.0

Bug fixes

Improve availability zone selection and validation https://github.com/cortexlabs/cortex/pull/885 https://github.com/cortexlabs/cortex/pull/891 (deliahu)
Validate predictor implementation using getfullargspec() https://github.com/cortexlabs/cortex/pull/902 (vishalbollu)
Do not remove cluster configuration cache if refresh fails https://github.com/cortexlabs/cortex/pull/893 (vishalbollu)

New Examples

Keras autoencoder for filtering out the noise from text documents https://github.com/cortexlabs/cortex/pull/834 (RobertLucian)

Misc

Check cluster status before executing cluster commands https://github.com/cortexlabs/cortex/pull/881 https://github.com/cortexlabs/cortex/pull/879 https://github.com/cortexlabs/cortex/pull/892 (vishalbollu)
Check for AWS Administrator IAM access in cluster up and cluster down commands https://github.com/cortexlabs/cortex/pull/878 (deliahu)
Wait for cloudformation stacks to delete during cluster down https://github.com/cortexlabs/cortex/pull/876 (vishalbollu)
Verify cortex operator url during cortex configure https://github.com/cortexlabs/cortex/pull/877 (vishalbollu)
Allow blank bucket values in cluster.yaml https://github.com/cortexlabs/cortex/pull/875 (vishalbollu)
Improve various error messages https://github.com/cortexlabs/cortex/pull/895 https://github.com/cortexlabs/cortex/pull/896 https://github.com/cortexlabs/cortex/pull/897 https://github.com/cortexlabs/cortex/pull/899 https://github.com/cortexlabs/cortex/pull/905 (deliahu)
Improve documentation https://github.com/cortexlabs/cortex/pull/861 https://github.com/cortexlabs/cortex/pull/853 https://github.com/cortexlabs/cortex/pull/851 https://github.com/cortexlabs/cortex/pull/868 https://github.com/cortexlabs/cortex/pull/870 https://github.com/cortexlabs/cortex/pull/871, https://github.com/cortexlabs/cortex/pull/872, https://github.com/cortexlabs/cortex/commit/ac481b93e2254d4753201909fdd8063df3484661 https://github.com/cortexlabs/cortex/commit/3ad39039ca366e240a83779845bae6d3f0d4b853 (deliahu, vishalbollu, RobertLucian)

Breaking changes

Remove json_tricks for encoding API responses (responses from predict() must now be json serializable) https://github.com/cortexlabs/cortex/pull/908 (vishalbollu)

- Go
Published by deliahu about 6 years ago

cortex - v0.14.1

Bug fixes * Fix JSON parsing before it gets passed to predict() https://github.com/cortexlabs/cortex/pull/865 (vishalbollu) * Support setup.py packages in requirements.txt https://github.com/cortexlabs/cortex/pull/864 (deliahu) * Run TensorFlow Predictor's model validations in the region that contains the bucket https://github.com/cortexlabs/cortex/pull/866 (deliahu)

Misc * Check that cluster image versions match the CLI version https://github.com/cortexlabs/cortex/pull/862 (deliahu) * Add Cython to Python predictor image https://github.com/cortexlabs/cortex/commit/4a463cab809e67ae257b2ac5a41ee94a7cdbce93 (deliahu)

- Go
Published by vishalbollu about 6 years ago

cortex - v0.14.0

New features

Support request-based autoscaling https://github.com/cortexlabs/cortex/pull/815 https://github.com/cortexlabs/cortex/pull/838 https://github.com/cortexlabs/cortex/issues/573 (vishalbollu, deliahu)
Support fine-grained configuration for autoscaling algorithm behavior https://github.com/cortexlabs/cortex/pull/815 (deliahu)
Support configurable in-replica parallelism (i.e. workers, threads) https://github.com/cortexlabs/cortex/pull/838 https://github.com/cortexlabs/cortex/issues/590 (vishalbollu, deliahu)
Support configurable request queue length https://github.com/cortexlabs/cortex/pull/838 https://github.com/cortexlabs/cortex/issues/646 (vishalbollu)
Support .cortexignore file to exclude files/directories from Cortex project zip https://github.com/cortexlabs/cortex/pull/800 https://github.com/cortexlabs/cortex/issues/723 (wingkwong)

Bug fixes

Ensure previous logs are never shown after showing newer ones https://github.com/cortexlabs/cortex/pull/792 (deliahu)
Skip service quota validation in unsupported regions https://github.com/cortexlabs/cortex/pull/825 (deliahu)
Fix prediction metrics when specifying tracker.key https://github.com/cortexlabs/cortex/pull/793 (deliahu)

New Examples

Real-Time License Plate Detector Example Project (YOLOv3, CRAFT, CRNN) https://github.com/cortexlabs/cortex/pull/803 (RobertLucian)

Misc

Show a warning if AWS session token is detected https://github.com/cortexlabs/cortex/pull/842 (chrisranderson, vishalbollu)
Disable NAT gateway https://github.com/cortexlabs/cortex/pull/808 (deliahu)
Add debug information to cluster error messages https://github.com/cortexlabs/cortex/commit/5a1a2bcba5d5634eb3fd8fd07c64f9556df386e0 https://github.com/cortexlabs/cortex/pull/850 (deliahu)
Add cluster costs to README https://github.com/cortexlabs/cortex/pull/807 https://github.com/cortexlabs/cortex/issues/806 (bcjordan, deliahu)
Document which system packages are installed in Docker images https://github.com/cortexlabs/cortex/pull/847 https://github.com/cortexlabs/cortex/issues/822 (deliahu)
Update pytorch examples to use GPU https://github.com/cortexlabs/cortex/pull/849 (vishalbollu)
Install libsndfile1 in API Docker images https://github.com/cortexlabs/cortex/pull/826 (deliahu)

Breaking API Changes

min_replicas, max_replicas, and init_replicas have been moved from the compute configuration key to autoscaling
max_surge and max_unavailable have been moved from the compute configuration key to update_strategy
target_cpu_utilization has been removed in favor of the request-based scaling configuration parameters (see the autoscaling docs for a detailed explanation of the new parameters)

- Go
Published by deliahu about 6 years ago

cortex - v0.13.1

Bug fixes

Fix cortex cluster update when using spot instances with no on-demand backup nodegroup https://github.com/cortexlabs/cortex/pull/787 (vishalbollu)

Misc

Set locale to en_US.UTF-8 https://github.com/cortexlabs/cortex/pull/784 (deliahu, RobertLucian)
Cause replica to error when pip install fails https://github.com/cortexlabs/cortex/commit/394862b0f9ce303f71ce9d18996c13350d47e2c5 (deliahu)
Query EKS price from AWS Pricing API https://github.com/cortexlabs/cortex/pull/783 (deliahu)
Assert API version before inspecting args in python https://github.com/cortexlabs/cortex/pull/789 (vishalbollu)
Improve healthcheck https://github.com/cortexlabs/cortex/pull/788 (vishalbollu)

- Go
Published by deliahu over 6 years ago

cortex - v0.13.0

New features

Support on-demand instance backup when spot instances are not available https://github.com/cortexlabs/cortex/pull/745 https://github.com/cortexlabs/cortex/issues/629 (vishalbollu)
Remove kind: deployment from API configuration https://github.com/cortexlabs/cortex/pull/759 (deliahu, vishalbollu)
Add cortex refresh <api_name> command https://github.com/cortexlabs/cortex/pull/759 https://github.com/cortexlabs/cortex/issues/758 (deliahu)
Update cortex delete <api_name> command https://github.com/cortexlabs/cortex/pull/759 (deliahu)
Add configuration for rolling update strategy (max_surge and max_unavailable) https://github.com/cortexlabs/cortex/pull/763 (deliahu)
Support programatic CLI configuration via command line flags https://github.com/cortexlabs/cortex/pull/764 https://github.com/cortexlabs/cortex/issues/729 (deliahu)
Support small instance types https://github.com/cortexlabs/cortex/pull/720 (deliahu)
Add env flag to cluster up and cluster update commands https://github.com/cortexlabs/cortex/pull/731 (deliahu)

Bug fixes

Limit cluster growth rate to avoid Kubernetes API server crashes https://github.com/cortexlabs/cortex/pull/769 (vishalbollu)
Use configured max price for filtering spot instance distribution https://github.com/cortexlabs/cortex/pull/746 https://github.com/cortexlabs/cortex/issues/719 (vishalbollu)
Disallow nano and mirco instances https://github.com/cortexlabs/cortex/commit/84f09375a7eb9d817e7be40e246cb190b9eb8393 https://github.com/cortexlabs/cortex/issues/755 (deliahu)
Fix pod status calculation to classify successfully recovered replicas as ready https://github.com/cortexlabs/cortex/commit/c5d97eb2156592ebaa73c3481404bf4a496dbcf1 https://github.com/cortexlabs/cortex/issues/738 (deliahu)

New Examples

Object detection in images with R-CNN https://github.com/cortexlabs/cortex/pull/754 (ArkinDharawat)
Fastai https://github.com/cortexlabs/cortex/pull/725 (caleb-kaiser)

Misc

Add total cluster price to installation confirmation message https://github.com/cortexlabs/cortex/pull/714 https://github.com/cortexlabs/cortex/pull/775 https://github.com/cortexlabs/cortex/issues/713 (deliahu, vishalbollu)
Prompt before attempting to zip large files, many files, or large total folder size https://github.com/cortexlabs/cortex/pull/752 https://github.com/cortexlabs/cortex/pull/767 https://github.com/cortexlabs/cortex/issues/721 https://github.com/cortexlabs/cortex/issues/722 (vishalbollu, deliahu)
Reduce cortex operator kubernetes API calls https://github.com/cortexlabs/cortex/pull/759 https://github.com/cortexlabs/cortex/issues/672 (deliahu)
Reduce fluentd kubernetes API calls https://github.com/cortexlabs/cortex/pull/759 https://github.com/cortexlabs/cortex/issues/672 (vishalbollu)
Add EKS control plane logging https://github.com/cortexlabs/cortex/pull/753 https://github.com/cortexlabs/cortex/issues/717 (vishalbollu)
Enforce that bucket and cluster regions match https://github.com/cortexlabs/cortex/pull/777 (deliahu)
Update ONNX runtime to 1.1.0 https://github.com/cortexlabs/cortex/commit/1e74ab7168534183710ab1f6dfe96666e08fb323 https://github.com/cortexlabs/cortex/commit/64f95b7598986a6937b16f0c21d820de4d353221 https://github.com/cortexlabs/cortex/issues/571 (deliahu)
Direct users to check auto scaling group activity history if cluster up fails https://github.com/cortexlabs/cortex/pull/757 https://github.com/cortexlabs/cortex/issues/740 (vishalbollu)
Pre-install opencv system packages https://github.com/cortexlabs/cortex/pull/772 (vishalbollu)
Improve config validations https://github.com/cortexlabs/cortex/pull/751 https://github.com/cortexlabs/cortex/commit/c0a89a2224f98884cba2af53e8a108b7c32bf842 https://github.com/cortexlabs/cortex/issues/732 https://github.com/cortexlabs/cortex/issues/742 (deliahu, vishalbollu)
Increase metrics server memory request/limit https://github.com/cortexlabs/cortex/commit/60f00b079f7b87321f0c9d87454c02e6ec25e8e2#diff-d62cba9784a96fc0a7471ca4d8b38e96 https://github.com/cortexlabs/cortex/issues/748 (deliahu)
Disable operator autoscaling https://github.com/cortexlabs/cortex/pull/743 (vishalbollu)
Add kubectl top to cortex cluster info --debug output https://github.com/cortexlabs/cortex/pull/756 https://github.com/cortexlabs/cortex/issues/716 (vishalbollu)

- Go
Published by deliahu over 6 years ago

cortex - v0.12.0

New features

Support new instance types (e.g. g3 and g4 instances) https://github.com/cortexlabs/cortex/pull/655 (deliahu)
Support batched TensorFlow and ONNX predictions https://github.com/cortexlabs/cortex/pull/666 https://github.com/cortexlabs/cortex/issues/562 (vishalbollu)
Allow users to configure availability zones https://github.com/cortexlabs/cortex/pull/681 https://github.com/cortexlabs/cortex/issues/677 (vishalbollu)
Support multiple cortex clusters in the same region https://github.com/cortexlabs/cortex/pull/661 https://github.com/cortexlabs/cortex/pull/664 https://github.com/cortexlabs/cortex/issues/660 (deliahu)
Add AWS resource pricing to cortex cluster up confirmation message https://github.com/cortexlabs/cortex/pull/647 https://github.com/cortexlabs/cortex/pull/690 https://github.com/cortexlabs/cortex/issues/641 (deliahu)
Autofill instance distribution based on spot price https://github.com/cortexlabs/cortex/pull/670 https://github.com/cortexlabs/cortex/issues/603 (vishalbollu)
Add support for passing environment variables through to containers https://github.com/cortexlabs/cortex/pull/694 https://github.com/cortexlabs/cortex/issues/688 (vishalbollu)

Bug fixes

Surface operator connection error messages in CLI https://github.com/cortexlabs/cortex/pull/659 https://github.com/cortexlabs/cortex/issues/658 (deliahu)
Fix occasional logs stream errors https://github.com/cortexlabs/cortex/pull/689 (vishalbollu)
Install pip packages with --no-cache-dir https://github.com/cortexlabs/cortex/pull/623 (vishalbollu)

Misc

Convert predictor APIs into Python classes https://github.com/cortexlabs/cortex/pull/636 https://github.com/cortexlabs/cortex/pull/666 https://github.com/cortexlabs/cortex/issues/589 (vishalbollu)
Rename sample to payload in Python APIs https://github.com/cortexlabs/cortex/pull/626 (vishalbollu)
Confirm before deleting a deployment https://github.com/cortexlabs/cortex/pull/692 https://github.com/cortexlabs/cortex/issues/674 (vishalbollu)
Check for unsupported instance types https://github.com/cortexlabs/cortex/commit/952a1f7af4a12c6560f2968961bfe4a7514c472f (deliahu)
Check user EC2 limits before spinning up instances https://github.com/cortexlabs/cortex/pull/638 https://github.com/cortexlabs/cortex/pull/653 https://github.com/cortexlabs/cortex/issues/584 (vishalbollu, deliahu)
Add cortex cluster info --debug command https://github.com/cortexlabs/cortex/pull/691 https://github.com/cortexlabs/cortex/issues/657 (deliahu)
Remove upper limit on CPU target utilization https://github.com/cortexlabs/cortex/pull/635 (deliahu)
Improve cortex deploy response message https://github.com/cortexlabs/cortex/pull/650 https://github.com/cortexlabs/cortex/issues/642 (deliahu)
Improve API status output https://github.com/cortexlabs/cortex/pull/656 https://github.com/cortexlabs/cortex/issues/652 (deliahu)
Improve spot config documentation https://github.com/cortexlabs/cortex/pull/670 https://github.com/cortexlabs/cortex/issues/627 (vishalbollu)
Pre-download Docker images on cluster installation https://github.com/cortexlabs/cortex/pull/662 https://github.com/cortexlabs/cortex/issues/569 (deliahu)
Remove cortex support command https://github.com/cortexlabs/cortex/pull/683 https://github.com/cortexlabs/cortex/issues/668 (vishalbollu)
Stream logs from all pods to CloudWatch https://github.com/cortexlabs/cortex/pull/671 https://github.com/cortexlabs/cortex/issues/586 (vishalbollu)
Support running cortex deploy from subdirectories https://github.com/cortexlabs/cortex/pull/675 https://github.com/cortexlabs/cortex/issues/673 (deliahu)
Set log group and bucket name defaults to cluster name https://github.com/cortexlabs/cortex/pull/693 https://github.com/cortexlabs/cortex/issues/680 (vishalbollu)

- Go
Published by deliahu over 6 years ago

cortex - v0.11.1

Bug fixes

Update healthcheck to use exec instead of using http healthz https://github.com/cortexlabs/cortex/pull/648 (vishalbollu)

Misc

Allow configuring waitress parameters via API metadata https://github.com/cortexlabs/cortex/commit/5529873c05ba6cc28896d0b2167891794313f924 https://github.com/cortexlabs/cortex/commit/a354308512929584c90592170baf66583dd85bca (deliahu)

- Go
Published by deliahu over 6 years ago

cortex - v0.11.0

New features

Support spot instances https://github.com/cortexlabs/cortex/pull/585 https://github.com/cortexlabs/cortex/pull/597 https://github.com/cortexlabs/cortex/issues/469 (vishalbollu)

Examples

Add MLflow example https://github.com/cortexlabs/cortex/pull/566 https://github.com/cortexlabs/cortex/issues/553 (ospillinger)
Add language identification example (fastText) https://github.com/cortexlabs/cortex/commit/0173bc432a748c59ea6d9bd8419d58805ac81c56 (ospillinger)
Add answer generation example https://github.com/cortexlabs/cortex/pull/580 (ospillinger)
Add reading comprehension example https://github.com/cortexlabs/cortex/pull/581 (ospillinger)
Add text summarization example https://github.com/cortexlabs/cortex/commit/be42b7c1ec707b92e36dd95123615b17a9a26ff0 (ospillinger)

Misc

Create separate nodegroup for Cortex operator containers https://github.com/cortexlabs/cortex/pull/577 https://github.com/cortexlabs/cortex/issues/500 (vishalbollu)
Improve API logging https://github.com/cortexlabs/cortex/pull/596 https://github.com/cortexlabs/cortex/issues/587 (deliahu)
Improve CLI output https://github.com/cortexlabs/cortex/pull/570 https://github.com/cortexlabs/cortex/issues/567 https://github.com/cortexlabs/cortex/issues/568 https://github.com/cortexlabs/cortex/issues/574 (deliahu)
Update API info endpoint route and response https://github.com/cortexlabs/cortex/pull/594 https://github.com/cortexlabs/cortex/issues/593 (deliahu)
Add quickstart / tutorial https://github.com/cortexlabs/cortex/pull/595 (ospillinger)
Create CONTRIBUTING.md https://github.com/cortexlabs/cortex/pull/555 https://github.com/cortexlabs/cortex/issues/310 (ospillinger)

- Go
Published by vishalbollu over 6 years ago

cortex - v0.10.3

Misc:

Make instance volume size configurable https://github.com/cortexlabs/cortex/commit/f0adbb0c55085aefae760b78e4bc7df27f0cd16f (deliahu)

- Go
Published by deliahu over 6 years ago

cortex - v0.10.2

Bug fixes:

Query node group min and max size from AWS AutoScalingGroup API https://github.com/cortexlabs/cortex/pull/576 https://github.com/cortexlabs/cortex/issues/575 (deliahu)
Route API requests on exact URL matches, not prefix matches https://github.com/cortexlabs/cortex/commit/bb22971d79bad6f728b3a9716b511795208f65f9 (deliahu)

- Go
Published by deliahu over 6 years ago

cortex - v0.10.1

Bug fixes:

Refresh logger after loading user modules https://github.com/cortexlabs/cortex/pull/563 (vishalbollu)
Remove extra parameters that may be sent JSON tricks encoder initialization https://github.com/cortexlabs/cortex/pull/565 (vishalbollu)
Set all TensorFlow version directory names to "1" https://github.com/cortexlabs/cortex/pull/560 https://github.com/cortexlabs/cortex/issues/354 (deliahu)
Convert TensorFlow model prefix to a directory https://github.com/cortexlabs/cortex/commit/10b62b43edb436c3787bf0138203e04429494478 (deliahu)

Misc

Show CLI version regardless of cluster status https://github.com/cortexlabs/cortex/commit/0e218ab1e8cc9aaef1a95b9c30a0d09327151516 (deliahu)
Check EKS status before spinning up cluster https://github.com/cortexlabs/cortex/pull/561 (deliahu)
Add git to API images https://github.com/cortexlabs/cortex/pull/559 (vishalbollu)

- Go
Published by deliahu over 6 years ago

cortex - v0.10.0

New features

Add Predictor model format to support deploying generic Python models https://github.com/cortexlabs/cortex/pull/535 https://github.com/cortexlabs/cortex/pull/549 https://github.com/cortexlabs/cortex/issues/506 (vishalbollu)
Ship with popular Python packages out of the box https://github.com/cortexlabs/cortex/pull/550 (vishalbollu)
Add metadata field to API configuration https://github.com/cortexlabs/cortex/pull/535 https://github.com/cortexlabs/cortex/pull/544 https://github.com/cortexlabs/cortex/issues/522 (vishalbollu)
Support custom API endpoints https://github.com/cortexlabs/cortex/pull/534 https://github.com/cortexlabs/cortex/issues/531 https://github.com/cortexlabs/cortex/issues/223 (deliahu)
Move cluster installation and management commands to the CLI https://github.com/cortexlabs/cortex/pull/527 https://github.com/cortexlabs/cortex/issues/501 https://github.com/cortexlabs/cortex/issues/499 https://github.com/cortexlabs/cortex/issues/515 https://github.com/cortexlabs/cortex/issues/518 https://github.com/cortexlabs/cortex/issues/526 (deliahu)

Examples

Add an example with Hugging Face's DistilGPT2 text generation model (PyTorch) https://github.com/cortexlabs/cortex/pull/535 https://github.com/cortexlabs/cortex/issues/507 (vishalbollu)
Add an example for predicting miles per gallon to demonstrate regression (scikit-learn) https://github.com/cortexlabs/cortex/pull/535 https://github.com/cortexlabs/cortex/issues/497 (vishalbollu)

- Go
Published by deliahu over 6 years ago

cortex -

Bug fixes

Fix bug in multi-input ONNX models https://github.com/cortexlabs/cortex/commit/a6bdb5f74342c2bfb60bb2ca1d4cd6f05006853a (vishalbollu)
Don't update API metrics on non-POST requests https://github.com/cortexlabs/cortex/commit/f1bc223d7b6280b458b5296957d7dc0332155498 (deliahu)

Misc

Add CORS headers https://github.com/cortexlabs/cortex/pull/540 (deliahu)

- Go
Published by deliahu over 6 years ago

cortex -

Bug fixes:

Downgrade AWS CNI from v1.5.4 to v1.5.3 https://github.com/cortexlabs/cortex/commit/dba3ef774f9f189c49649c6d270adb830b63fded (deliahu)

- Go
Published by deliahu over 6 years ago

cortex -

New features

Add Cortex Python client https://github.com/cortexlabs/cortex/pull/488 https://github.com/cortexlabs/cortex/issues/467 (vishalbollu)
Add Cortex support CLI command https://github.com/cortexlabs/cortex/pull/491 https://github.com/cortexlabs/cortex/issues/336 (vishalbollu)
Add configure --print CLI command https://github.com/cortexlabs/cortex/commit/52ceae32b67d04d7ca6f6377c24d9aa5cdfa84db (deliahu)

Bug fixes:

Prevent load balancer from timing out requests https://github.com/cortexlabs/cortex/pull/490 https://github.com/cortexlabs/cortex/commit/adcf18cce3901f17b8bcd4a7b87e0c532a4744e5 https://github.com/cortexlabs/cortex/issues/487 (vishalbollu)
Remove unnecessary lock in operator init https://github.com/cortexlabs/cortex/commit/411bac6872b01306e3b9a5765db5edca5f3f0032 (deliahu)
Silence stale API saved status not found errors https://github.com/cortexlabs/cortex/commit/aeac492b8b01a73bf4c986f4ddffa88422a593df (deliahu)
Remove availability zone configuration https://github.com/cortexlabs/cortex/commit/2e8913ba5a21fa86494aaee8e45ae0ee1c976b08 https://github.com/cortexlabs/cortex/issues/494 (deliahu)
Show correct URL upon failed HTTP request from CLI https://github.com/cortexlabs/cortex/pull/504 (vishalbollu)

Examples

Shorten gpt-2 model output length https://github.com/cortexlabs/cortex/commit/1a56661bee2645ed12ffb9f232770c40c7ec8473 (deliahu)

Misc

Validate access to cortex bucket on deploy https://github.com/cortexlabs/cortex/pull/511 https://github.com/cortexlabs/cortex/issues/502 (deliahu)
Remove cortex namespace configuration option https://github.com/cortexlabs/cortex/commit/abdc0cd241d6a62b382039392afec5427ca77463 (deliahu)

- Go
Published by deliahu over 6 years ago

cortex -

New features

Add prediction response tracking https://github.com/cortexlabs/cortex/pull/322 https://github.com/cortexlabs/cortex/pull/360 https://github.com/cortexlabs/cortex/pull/378 https://github.com/cortexlabs/cortex/pull/419 https://github.com/cortexlabs/cortex/pull/481 https://github.com/cortexlabs/cortex/commit/81718b4c9a750e685bebb52e79204fea10933ab4 https://github.com/cortexlabs/cortex/issues/225 (vishalbollu)
Add networking metrics (latency, error codes) https://github.com/cortexlabs/cortex/pull/278 https://github.com/cortexlabs/cortex/pull/420 https://github.com/cortexlabs/cortex/pull/475 https://github.com/cortexlabs/cortex/pull/472 https://github.com/cortexlabs/cortex/issues/187 (vishalbollu, 1vn)
Support importing local python files in handlers https://github.com/cortexlabs/cortex/pull/398 https://github.com/cortexlabs/cortex/pull/452 (1vn, vishalbollu)
Support TensorFlow model directories on S3 https://github.com/cortexlabs/cortex/pull/323 https://github.com/cortexlabs/cortex/pull/373 https://github.com/cortexlabs/cortex/issues/215 https://github.com/cortexlabs/cortex/issues/366 (1vn)
Support user-specified TensorFlow signature def keys https://github.com/cortexlabs/cortex/pull/365 https://github.com/cortexlabs/cortex/pull/471 https://github.com/cortexlabs/cortex/pull/459 https://github.com/cortexlabs/cortex/issues/343 (1vn, vishalbollu, deliahu)
Improve signature def detection https://github.com/cortexlabs/cortex/pull/460 https://github.com/cortexlabs/cortex/commit/28dc9892f057a9141a8f6ae5385470ecbc4b4119 https://github.com/cortexlabs/cortex/issues/451 (vishalbollu)
Add debug mode to API requests https://github.com/cortexlabs/cortex/pull/369 https://github.com/cortexlabs/cortex/issues/328 (1vn)
Support print statements in handlers https://github.com/cortexlabs/cortex/pull/406 https://github.com/cortexlabs/cortex/pull/377 https://github.com/cortexlabs/cortex/issues/339 (vishalbollu, 1vn)
Automatically configure operator URL when installing Cortex https://github.com/cortexlabs/cortex/pull/401 https://github.com/cortexlabs/cortex/issues/334 (1vn)

Bug fixes:

Evict pods that consume too much memory https://github.com/cortexlabs/cortex/pull/426 https://github.com/cortexlabs/cortex/issues/424 (deliahu)
Show logs from init containers https://github.com/cortexlabs/cortex/pull/393 https://github.com/cortexlabs/cortex/issues/324 (vishalbollu)
Support "None" dims in model signatures https://github.com/cortexlabs/cortex/pull/465 (deliahu)
Fix line wrapping with CLI --watch flag https://github.com/cortexlabs/cortex/commit/b4f72570108bd17a07ba7345c24b34a87ec11d4b (deliahu)

Examples

Convert example model code to notebooks https://github.com/cortexlabs/cortex/pull/480 (deliahu)
Add GPT-2 text generation example https://github.com/cortexlabs/cortex/pull/353 (1vn)
Add Bert sentiment anaylsis example https://github.com/cortexlabs/cortex/pull/295 (1vn)
Add Alexnex Pytorch example https://github.com/cortexlabs/cortex/pull/477 (vishalbollu)
Add Imagenet inception example https://github.com/cortexlabs/cortex/pull/344 https://github.com/cortexlabs/cortex/issues/318 (1vn)
Add normalizaion to iris sklearn example https://github.com/cortexlabs/cortex/pull/337 (deliahu)

Misc

Remove sample key from prediction API and prediction key from prediction response https://github.com/cortexlabs/cortex/pull/399 https://github.com/cortexlabs/cortex/issues/389 (vishalbollu)
Remove response key from TensorFlow prediction response https://github.com/cortexlabs/cortex/pull/478 (vishalbollu)
Pass onnx model output directly to post_inference request handler https://github.com/cortexlabs/cortex/pull/476 (vishalbollu)
Use HTTP endpoints by default https://github.com/cortexlabs/cortex/pull/350 https://github.com/cortexlabs/cortex/issues/327 (1vn)
Remove verbose flag from logs command https://github.com/cortexlabs/cortex/pull/400 https://github.com/cortexlabs/cortex/issues/391 (vishalbollu)
Add operator AWS credentials https://github.com/cortexlabs/cortex/pull/349 (ospillinger)
Replace non-ready APIs without rolling update https://github.com/cortexlabs/cortex/pull/448 https://github.com/cortexlabs/cortex/issues/407 (deliahu)
Autocast numpy objects to appropriate type https://github.com/cortexlabs/cortex/pull/384 https://github.com/cortexlabs/cortex/issues/338 (vishalbollu)
Add out-of-memory error https://github.com/cortexlabs/cortex/pull/418 https://github.com/cortexlabs/cortex/issues/372 (deliahu)
Add more fine-grained status messages https://github.com/cortexlabs/cortex/pull/440 https://github.com/cortexlabs/cortex/issues/408 (deliahu)
Don't require --force once min replicas are met https://github.com/cortexlabs/cortex/pull/449 https://github.com/cortexlabs/cortex/issues/359 (deliahu)
Validate that requested resources can fit in a node before deploying https://github.com/cortexlabs/cortex/pull/379 https://github.com/cortexlabs/cortex/issues/306 (1vn)
Validate that request handlers exist before deploying https://github.com/cortexlabs/cortex/pull/438 https://github.com/cortexlabs/cortex/issues/427 https://github.com/cortexlabs/cortex/issues/428 (vishalbollu)
Enforce zip file size limit https://github.com/cortexlabs/cortex/pull/457 https://github.com/cortexlabs/cortex/issues/437 (vishalbollu)
Support numeric CPU values in API configuration https://github.com/cortexlabs/cortex/pull/413 https://github.com/cortexlabs/cortex/issues/395 (deliahu)
Improve handler loading error messages https://github.com/cortexlabs/cortex/pull/382 https://github.com/cortexlabs/cortex/pull/352 https://github.com/cortexlabs/cortex/pull/479 https://github.com/cortexlabs/cortex/issues/292 https://github.com/cortexlabs/cortex/issues/341 (vishalbollu)
Improve cortex.sh configuration and logging https://github.com/cortexlabs/cortex/commit/d4e77383e5eb5485134b95d2da10bd45d44ecd77 https://github.com/cortexlabs/cortex/commit/577b31e5f13f370679673c4aa7e0ce2da1e58196 https://github.com/cortexlabs/cortex/commit/5a96fa706a93bcf5cbcd65092679f3fc23ec1949 https://github.com/cortexlabs/cortex/commit/08fde5b83e350934dadb9a8dbd6414c52f32a07a https://github.com/cortexlabs/cortex/commit/994a49bfedf677ad5842561ceb8e93882efba871 https://github.com/cortexlabs/cortex/commit/4c196a6e7bc86090d731d8380afc41f7ebac1ac5 (deliahu, ospillinger)
Add timestamp to logs https://github.com/cortexlabs/cortex/pull/402 https://github.com/cortexlabs/cortex/issues/390 (vishalbollu)
Only read deployment configuration from cortex.yaml https://github.com/cortexlabs/cortex/pull/396 https://github.com/cortexlabs/cortex/issues/387 (deliahu)
Rename default environment to "default" https://github.com/cortexlabs/cortex/commit/6051dcdeb4017c1e44bc6abbf2924c9ccf65f0a5 (deliahu)
Set Python version to 3.6 https://github.com/cortexlabs/cortex/pull/461 (deliahu)
Update TensorFlow version to 1.14 https://github.com/cortexlabs/cortex/commit/ba0b541fdfa1492215beea18209cf632df2c9234 (deliahu)
Use Istio for networking https://github.com/cortexlabs/cortex/pull/237 https://github.com/cortexlabs/cortex/pull/374 https://github.com/cortexlabs/cortex/issues/201 (1vn deliahu)
Stream logs from cloudwatch https://github.com/cortexlabs/cortex/pull/447 https://github.com/cortexlabs/cortex/pull/466 (vishalbollu)

- Go
Published by deliahu over 6 years ago

cortex -

Bug fixes:

Update TensorFlow createrawprediction_request https://github.com/cortexlabs/cortex/commit/55f9916bc4caeb83ede891c0ee575be345910aac https://github.com/cortexlabs/cortex/commit/bcc54016867054dcf13fce6e19fed0f0b63c7590 (1vn)

Misc

CLI output redesign https://github.com/cortexlabs/cortex/pull/335 https://github.com/cortexlabs/cortex/issues/297 (vishalbollu)
Add upgrade docs https://github.com/cortexlabs/cortex/commit/97e92820af26d18be69e671599a61dd4d9cb69a4 (ospillinger)

- Go
Published by deliahu almost 7 years ago

cortex -

Bug fixes:

Allow unlimited request body size https://github.com/cortexlabs/cortex/commit/f846bd8356699770bb596bacdd5a41bc814ff917 (deliahu)
Increase TF Serving GRPC timeout https://github.com/cortexlabs/cortex/commit/b7935246af64951f541971e779c96ad8ba80ea51 (1vn)
Validate prepared sample https://github.com/cortexlabs/cortex/commit/e8df4f59565a92513290167d057551b86e4deff0 (1vn)
Add TensorFlow data types https://github.com/cortexlabs/cortex/commit/73471d50546d46efb660815ee98bd759e907b232 (deliahu)
Fix shape inference for TF serving https://github.com/cortexlabs/cortex/commit/ec3a798749d74ff5d94b39561a58042bb529d7ca (1vn)

Misc

Improve model schema printing on errors https://github.com/cortexlabs/cortex/commit/b7935246af64951f541971e779c96ad8ba80ea51 (vishalbollu)
Add API request logging https://github.com/cortexlabs/cortex/commit/483fb13c49eed6b4f35d3e35b0ff17dcf2ac3b40 (deliahu)
Use tensorflow/serving Docker images https://github.com/cortexlabs/cortex/commit/2e7a4ac0d5e974af21abcf934fbe9f36f4a7ac10 (deliahu)
Remove sample logging https://github.com/cortexlabs/cortex/commit/b7935246af64951f541971e779c96ad8ba80ea51 (1vn)

- Go
Published by deliahu almost 7 years ago

cortex -

Bug fixes:

Show TensorFlow Serving logs https://github.com/cortexlabs/cortex/pull/299 https://github.com/cortexlabs/cortex/issues/300 (vishalbollu)
Update AWS CNI version to 1.5.2 https://github.com/cortexlabs/cortex/pull/291 https://github.com/cortexlabs/cortex/issues/247 https://github.com/cortexlabs/cortex/issues/219 (deliahu)
Fix handler packages https://github.com/cortexlabs/cortex/pull/298 (vishalbollu)
Update pod status logic https://github.com/cortexlabs/cortex/commit/d9ced094145ab95959f4366c92be8761832c2250 https://github.com/cortexlabs/cortex/commit/af9059fea9287a16bc89bc9dc545f7a114bcb7b2 https://github.com/cortexlabs/cortex/commit/ee4b3fc69b04e2cac2554b500e66ac61e54337c0 (deliahu)
Add retries to metadata downloading https://github.com/cortexlabs/cortex/pull/286 https://github.com/cortexlabs/cortex/issues/45 (deliahu)

Misc

Update to TensorFlow 1.14.0 https://github.com/cortexlabs/cortex/commit/ba0b541fdfa1492215beea18209cf632df2c9234 https://github.com/cortexlabs/cortex/issues/302 (deliahu)

- Go
Published by deliahu almost 7 years ago

cortex -

New features:

Add GPU support for serving ONNX models #232 #233 #220 (vishalbollu)
Set model format based on path if not explicitly specified #251 #206 (vishalbollu)
Improve get command output for APIs #263 #177 #257 #256 (vishalbollu)
Aggregate API logs in cortex logs command #227 #214 (vishalbollu)
Aggregate API logs in CloudWatch #259 #226 (vishalbollu)
Add CLI command to list active Cortex deployments #268 #117 (vishalbollu)

Misc:

Improve API ready timestamp #244 (deliahu)
Prevent scaling immediately after API creation #255 #222 (deliahu)
Add Cortex Namespace to python modules #230 #205 (vishalbollu)
Allow ctrl+c to kill the manager process #252 #246 (vishalbollu)
Improve logging around request handlers #240 #207 #204 (vishalbollu)
Improve error message for prediction api mismatch #249 #176 (vishalbollu)
Make all logs one line #216 (vishalbollu)
Document how to add system packages to docker containers #250 #245 (vishalbollu)
Replace Argo with in-operator DAG manager #235 #218 (deliahu)
Improve uninstall process #711017f (deliahu)
Restructure iris example #270 #266 (vishalbollu)

Full Changelog

- Go
Published by deliahu almost 7 years ago

cortex -

New features:

Add cluster autoscaler #194 #189 (ospillinger)
Add pod autoscaler #196 #188 (deliahu)
Automate/improve installation process, add manager image #193 #192 (ospillinger)
Add support for serving ONNX models #182 #181 #164 (vishalbollu)
Support Python pre- and post-processing for inference #182 #178 (vishalbollu)

Misc:

Add Iris TensorFlow example #208 (1vn)
Add more fine grained API status #195 (deliahu)

Full Changelog

- Go
Published by deliahu almost 7 years ago

cortex -

New features:

Remove status command, fold into get and logs #171 #166 #165 (deliahu)
Remove region for external data fa227d1 #174 (deliahu)
Return expected input schema for prediction request errors febc293 (deliahu)

Bugs fixes

Show previous logs for failed API pods f620125 #179 (deliahu)
Fix external constants ec96d80 (deliahu)

Misc:

Hide end-to-end components if only using serving ff4910a b0666fb #180 #167 (deliahu)
Rename app to deployment #175 #180 (deliahu)
Rename app.yaml to cortex.yaml 355fdfc #168 (deliahu)
Improve get command resource printing 5e301c6 (deliahu)
Improve python error message if external data doesn't exist #183 (deliahu)
Remove init CLI command a044d81 #172 (deliahu)
Update to go 1.12 #170 #169 (deliahu)

Full Changelog

- Go
Published by deliahu almost 7 years ago

cortex -

New features:

Support external constants #96 #163 (1vn)

Bug fixes:

Remove my_model.py from init CLI command fef5fae (deliahu)
Fix constant data status bug b7e0aa4 (deliahu)
Support resource references in model target_vocab 597d844 80e091e (deliahu)

- Go
Published by deliahu almost 7 years ago

cortex - v0.4.0

New features:

Input redesign #72 #154 (deliahu)
Add estimators #72 #154 (deliahu)
Support deploying external TensorFlow models #124 #154 (1vn)
Make raw columns optional #103 #111 (1vn)
Make aggregators and transformers optional #90 #100 (1vn)
Respond to prediction request with transformed columns #97 #153 (1vn)
Support bucket regions for data ingestion #115 #155 (vishalbollu)
Support not using an ingested column as a raw_column #69 #92 (vishalbollu)
Update to TensorFlow 1.13 #95 #116 (1vn)
Update to Spark 2.4.2 #87 (vishalbollu)
Validate app name does not have underscore #59 #112 (1vn)

Bug fixes:

Resolve Spark Context file added warnings #79 #137 (1vn)
Improve built-in index_string data format #68 #127 (1vn)
Address TF Serving gRPC Warning #61 #128 (1vn)
Ingestion of Parquet containing int or double columns throw validation errors #91 #92 (vishalbollu)
Update Argo version #74 #125 (1vn)
API is sometimes temporarily unavailable when updating #71 #85 (deliahu)
Resources not allocated to Spark workloads to generate training datasets #56 #86 (vishalbollu)

Misc:

Add development instructions #64 #102 (1vn)
Add recommendation example #58 #57 (vishalbollu)

Full Changelog

- Go
Published by deliahu almost 7 years ago

cortex - v0.3.0

docs.cortex.dev/v/0.3

Merged pull requests:

Rename transformed_column parameter in transform_spark() #49 (deliahu)
OOM (Out of memory) status #40 (1vn)
Change status to ingesting only after enough resources have been allocated #39 (vishalbollu
Change default TensorFlow log level to DEBUG #37 (1vn)
Transformer model sentiment analysis example #36 (1vn)
Add integration test to spark workloads #35 (vishalbollu)
Tensor2Tensor Example and transform_tensorflow feature #29 (1vn)

Full Changelog

- Go
Published by deliahu about 7 years ago

cortex -

docs.cortex.dev/v/0.2

Merged pull requests:

Allow specifying ranges in cortex requirements.txt #32 (vishalbollu)
Prevent users from installing conflicting packages #30 (vishalbollu)
Add additional config path error wrapping and index to embeds #15 (deliahu)
Show config path in config errors #14 (1vn)
Add ability to sample dataset #12 (vishalbollu)
Expose additional csv parsing options #10 (vishalbollu)
GPU support #6 (1vn)
Bring your own package #5 (vishalbollu)

Full Changelog

- Go
Published by deliahu about 7 years ago

cortex -

docs.cortex.dev/v/0.1

- Go
Published by deliahu over 7 years ago

Recent Releases of cortex

cortex - v0.42.1

v0.42.1

cortex - v0.42.0

v0.42.0

cortex - v0.41.0

v0.41.0

Nucleus Model Server

cortex - v0.40.0

v0.40.0

cortex - v0.39.1

v0.39.1

cortex - v0.39.0

v0.39.0

cortex - v0.38.0

v0.38.0

cortex - v0.37.0

v0.37.0

cortex - v0.36.0

v0.36.0

cortex - v0.35.0

v0.35.0

cortex - v0.34.0

v0.34.0

cortex - v0.33.0

v0.33.0

cortex - v0.32.0

v0.32.0

cortex - v0.31.1

v0.31.1

cortex - v0.31.0

v0.31.0

cortex - v0.30.0

v0.30.0

cortex - v0.29.0

v0.29.0

cortex - v0.28.0

v0.28.0

cortex - v0.27.0

v0.27.0

cortex - v0.26.0

v0.26.0

cortex - v0.25.0

v0.25.0

cortex - v0.24.1

v0.24.1

cortex - v0.24.0

v0.24.0

cortex - v0.23.0

v0.23.0

cortex - v0.22.1

v0.22.1

cortex - v0.22.0

v0.22.0

cortex - v0.21.0

cortex - v0.20.0

v0.20.0

cortex - v0.19.0

cortex - v0.18.1

cortex - v0.18.0

cortex - v0.17.1

cortex - v0.17.0

cortex - v0.16.1

cortex - v0.16.0

cortex - v0.15.1

cortex - v0.15.0

cortex - v0.14.1

cortex - v0.14.0

cortex - v0.13.1

cortex - v0.13.0

cortex - v0.12.0

cortex - v0.11.1

cortex - v0.11.0

cortex - v0.10.3

cortex - v0.10.2

cortex - v0.10.1

cortex - v0.10.0

cortex -

cortex -

cortex -