Releases: dstackai/dstack
0.18.28
CLI improvements
- Added alias
-R
for--reuse
withdstack apply
- Shorten model URL output
dstack apply
anddstack attach
no longer rely on external tools such asps
andgrep
on Unix-like systems andpowershell
on Windows. With this change, it's now possible to usedstack
CLI client in minimal environments such as Docker containers, including the official dstackai/dstack image
What's Changed
- Add
DSTACK_{RUNNER,SHIM}_DOWNLOAD_URL
env vars by @un-def in #2023 - [Feature] Add alias
-R
for--reuse
withdstack apply
by @peterschmidt85 in #2032 - Replace
ps | grep
with psutil in SSHAttach by @un-def in #2029 - Shorten model URL output in CLI by @jvstme in #2035
Full Changelog: 0.18.27...0.18.28
0.18.27
UI/UX improvements
This release fixes a login issue in the control plane UI and introduces other UI/UX improvements.
What's Changed
- Another batch of many minor improvements to the docs by @peterschmidt85 in #2016
- Show OpenAI-compatible endpoint URL in CLI by @jvstme in #2022
- [Bug]: Cannot open UI login screen by @olgenn in #2025
- [UI] Model page code snippets fixes and improvements by @olgenn in #2026
- [UI]: Fix curl sample in model code button by @jvstme in #2027
Full Changelog: 0.18.26...0.18.27
0.18.26
Git
Previously, when you called dstack init
, Git credentials were reused between users of the same project and repository.
Starting with this release, to improve security, dstack
no longer shares Git credentials across users.
Warning
If you submitted credentials earlier with dstack init
, they will continue to work. However, it is recommended that each user call dstack init
again to ensure they do not reuse credentials from other users.
Deleting legacy credentials
To ensure no credentials submitted earlier are shared across users, you can run the following SQL statements:
UPDATE repos SET creds = NULL;
UI
This update brings a few UI improvements:
- Added
Delete
button to theVolumes
page - Added
Refresh
button to all pages with lists:Runs
,Models
,Fleets
,Volumes
,Projects
- Improved
Code
button on the model page
What's changed
- Implement per-user repo creds storage by @un-def in #2004
- [UI] Add Refresh button to all pages with lists by @olgenn in #2007
- [UI] Include base URL and authentication token in the code snippets by @olgenn in #2006
- [UI] The Code button improvements on the Model page by @olgenn in #2001
- [UI] It's not possible to select and delete volumes by @olgenn in #2000
- [UI] [Bug]: Services without model mapping are displayed in Models UI by @olgenn in #1993
- Ensure sshd privsep dir in container is properly set up by @un-def in #2008
- [Docs] Many minor improvements to docs and examples by @peterschmidt85 in #2013
- [Docs] Services without a gateway by @jvstme in #2011
- [Docs] Add deployment section with vLLM, TGI and NIM. Remove alignment handbook by @Bihan in #1990
- [Docs] Updated Installation and Server deployment guides to include CloudFormation by @peterschmidt85
- [Docs] Update services docs to reflect that gateway is now optional by @peterschmidt85 in #2005
- [Examples] Add a CloudFormation template showing how to deploy dstack server to AWS by @peterschmidt85 in #1944
- [Examples] Add Airflow example by @r4victor in #1991
Full changelog: 0.18.25...0.18.26
0.18.25
Multiple volumes per mount point
It's now possible to specify a list of volumes for a mount point in run configurations:
...
volumes:
- name: [my-aws-eu-west-1-volume, my-aws-us-east-1-volume]
path: /volume_data
dstack
will choose and mount one volume from the list. This can be used to increase GPU availability by specifying different volumes for different regions, which is desirable for use cases like caching. Previously, it was possible to specify only one volume per mount point, so if there was no compute capacity in the volume's region, provisioning would fail.
DSTACK_NODES_IPS
environment variable
A new DSTACK_NODES_IPS
environment variable is now available for multi-node tasks. It contains a list of internal IP addresses of all nodes in the cluster, e.g. DSTACK_NODES_IPS="10.128.0.47\n10.128.0.48\n10.128.0.49"
. This feature enables cluster workloads that require configuring IP addresses of all the nodes.
What's Changed
- Adding an example of NIM by @deep-diver in #1853
- Support specifying multiple volumes per mount point by @r4victor in #1983
- Expose DSTACK_NODES_IPS env var by @r4victor in #1985
- Set minimum paramiko version to 3.2.0 by @un-def in #1984
- Limit azure-mgmt-network>=23.0.0,<28.0.0 by @r4victor in #1988
Full Changelog: 0.18.24...0.18.25
0.18.24
Backward compatibility
This update includes a hotfix for a backward compatibility issue that prevented CLI v0.18.23 from working with older versions of the dstack server.
What's changed
Full changelog: 0.18.23...0.18.24
0.18.23
Gateway is optional
Previously, running any service required setting up a gateway. With this update, a gateway is no longer needed to run a service for development purposes.
Service endpoint
- If no gateway is created, the service’s endpoint will be accessible at
<dstack server URL>/proxy/services/<project name>/<run name>/
. - If a service has a model mapping, the model will be accessible via the OpenAI-compatible endpoint at
<dstack server URL>/proxy/models/<project name>/
.
Note
While this change makes it much easier to use services for development, you will still need a gateway if you want to use a custom domain, enable HTTPS, or use auto-scaling.
Gateway property
If a gateway is created but isn’t needed for a service, set the gateway
property to false
. If you have multiple gateways, you can choose one by setting gateway
to the name of the gateway.
Model mapping
If the model is in OpenAI format, you can now use a shorter syntax for model mapping—simply set the model
property to the model's name.
type: service
image: ollama/ollama
commands:
- ollama serve &
- sleep 3
- ollama pull llama3.1
- fg
port: 11434
model: llama3.1
The longer syntax with more settings remains available.
Updating running services
Previously, updating a service’s configuration required restarting it. Now, you can update the replicas
and scaling
properties in place. Just run dstack apply
, and the changes will take effect. New replicas will be created while the old ones continue running.
What's changed
- [dind] Update
start-dockerd
script by @un-def in #1928 - Add
/proxy
prefix to dstack-proxy endpoints by @jvstme in #1939 - [shim] Unmount volumes when run exits by @un-def in #1937
- Return error when instance added to multiple fleets(#1699) by @swsvc in #1938
- unify project administration by @olgenn in #1946
- [shim] Change NVIDIA GPU detection method by @un-def in #1945
- Support service scaling via in-place updates by @r4victor in #1958
- [Docs] Document
resources.gpu.vendor
property by @un-def in #1957 - Fix SSH fleet hosts validation by @un-def in #1955
- Support chat models in
dstack-proxy
by @jvstme in #1953 - Add user tag to AWS and GCP volumes by @james-boydell in #1948
- Fix dstack-proxy dependencies by @jvstme in #1959
- Support DSTACK_SERVER_ADMIN_TOKEN env by @r4victor in #1960
- Fix migration
82b32a135ea2
by @un-def in #1962 - Fix dstack apply runs with new names by @r4victor in #1964
- [Blog] Introducing instance volumes to persist data on instances by @peterschmidt85 in #1965
- [UI]: Support in-server model proxy by @olgenn in #1966
- Short model mapping syntax by @jvstme in #1967
- Fix VolumeModel.user not loaded for volume detach by @r4victor in #1970
- Drop the
PROXY
feature flag by @jvstme in #1971 - Allow specifying gateway in service configurations by @jvstme in #1972
- Improve error handling in model proxy by @jvstme in #1973
New contributors
- @james-boydell made their first contribution in #1948
Full changelog: 0.18.22...0.18.23
0.18.22
Custom OS images on AWS
You can now configure your own AMIs for the AWS backend.
projects:
- name: main
backends:
- type: aws
creds:
type: default
os_images:
cpu:
name: my-cpu-ami
user: admin
nvidia:
name: my-nvidia-ami
user: ubuntu
This can be used as an alternative way to bring your software or data to the AWS instance and mount it into your runs using Instance volumes.
See the AWS backend reference for details on configuring OS images. Support for custom OS images in other backends is coming in future releases.
What's Changed
- [Blog] Docker and Docker Compose inside container by @peterschmidt85 in #1916
- [Examples] Update Chat UI compose.yaml by @un-def in #1919
- [Bug]: [UI] Dark YAML editor theme won't work bug ui by @olgenn in #1923
- Remove Cloud NAT check when provisioning by @r4victor in #1925
- Allow to customize AMIs used by AWS backend by @un-def in #1920
- Fix Azure hostname assignment by @r4victor in #1930
- Support GCP Shared VPC for some subnets by @r4victor in #1933
- Increase request body size limit for services by @jvstme in #1934
Full Changelog: 0.18.21...0.18.22
0.18.21
Instance volumes
In addition to network volumes, dstack
now allows to mount instance (host) filesystems inside the run container. As contents of the instance volume are specific to the instance where the run is executed, such volumes can be used in cases where data persistence is not critical, for example, as a cache:
type: task
commands:
- pip install -r requirements.txt
volumes:
# reuse pip cache between runs
- /dstack-cache/pip:/root/.cache/pip
See the instance volumes documentation for more information.
Azure custom and private networks
dstack
now supports configuring custom Azure networks, which was only possible on AWS and GCP before. In addition, you can now configure dstack
to provision instances without public IPs on Azure to take advantage of private networks:
type: azure
tenant_id: my_tenant_id
subscription_id: my_subscription_id
regions: [westeurope]
public_ips: false
vpc_ids:
westeurope: test-networks-rg/test-network
creds:
type: default
Read more about Azure networking configuration in the docs.
Python 3.13 support for dstack
package
The previous 0.18.20 release added support for Python 3.13 in run configurations. This release updates the dstack
package itself so that it works under Python 3.13. The dstack
package also drops Python 3.8 support that reached end of life. Note that python: 3.8
in run configurations is deprecated but still supported.
Multi-job UI
The control plain UI now displays detailed info on each job in the run, improving support for multi-node tasks and replicated services:
What's Changed
- Show all jobs in runs UI by @olgenn in #1887
- Revert the list of projects and users in Administration by @olgenn in #1888
- Fix instance price discrepancies in RunPod by @jvstme in #1891
- [Website] Change
Backends
toPartners
by @peterschmidt85 in #1893 - Support
auth: true
services in dstack-proxy by @jvstme in #1885 - [Docs]: Fix typos by @dheerajsir in #1897
- Support custom and private networks for Azure by @r4victor in #1896
- Fix Dstack Server Deployment Link by @SagarSharma101 in #1898
- Add instance volumes by @un-def in #1895
- Add nat_check option to GCP config by @r4victor in #1904
- Handle deleted volumes in attach_volume() by @r4victor in #1907
- Fix log message on getting run volumes by @r4victor in #1909
- [dind] Improve start-dockerd script by @un-def in #1908
- Show warning on missing backend deps by @r4victor in #1911
- Support Python 3.13, drop Python 3.8 by @jvstme in #1910
- Update pydantic-duality to fix the infinite recursion bug by @zmievsa in #1902
- Set ping_interval on WebSocket connection by @r4victor in #1918
New Contributors
- @dheerajsir made their first contribution in #1897
- @SagarSharma101 made their first contribution in #1898
- @zmievsa made their first contribution in #1902
Full Changelog: 0.18.20...0.18.21
0.18.20
Python 3.13 support
Following a recent Python 3.13 release on October 7, 2024, dstack
now supports python: 3.13
in run configurations. python: 3.8
is still supported but deprecated.
Note: the dstack
package itself does not yet work on Python 3.13 due to some limitations in dependencies. We're looking into supporting it as well.
Custom backend tags
You can now define custom tags that dstack
will assign to all cloud resources it creates including instances and volumes. The tags are defined in the backend configuration:
type: aws
tags:
company_department: finance
company_project: dstack
company_user: victor
creds:
type: default
Custom tags are supported for AWS, Azure, and GCP.
Improved support of AWS private subnets
Previously, when configuring an AWS backend to use private subnets (public_ips: false
), dstack
would require a NAT Gateway. Now dstack
supports more networking setups that provide outbound internet traffic including NAT Gateway, Transit Gateway, and VPC Peering Connection.
New required permissions
dstack
now sets labels on GCP volumes which requires acompute.disks.setLabels
permission.
Deprecations
python: 3.8
in run configurations is deprecated.
What's Changed
- Add created_at to projects and users by @r4victor in #1857
- Improvements for model details page in the UI by @olgenn in #1860
- [Bug]: Users logged out after rotating their tokens without seeing tokens by @olgenn in #1861
- [dind] Move dind processes to a separate cgroup by @un-def in #1859
- [Docs] Add Docker protip and Docker Compose example by @un-def in #1858
- Implement custom backend tags by @r4victor in #1872
- [shim] Remove anonymous volumes along associated container by @un-def in #1873
- Allow running services without a gateway by @jvstme in #1869
- [UX]: Resize chat input field based on content #1562 by @olgenn in #1875
- Collect AMD GPU metrics by @r4victor in #1877
- [Blog] Monitoring GPU usage and other container metrics by @peterschmidt85 in #1874
- [Docs] Rename
HUGGING_FACE_HUB_TOKEN
toHF_TOKEN
by @peterschmidt85 in #1871 - Support Python 3.13 and deprecate 3.8 in run configurations by @jvstme in #1878
- Support AWS private subnets with Transit Gateway by @r4victor in #1881
- Fix collecting metrics from CPU instances by @r4victor in #1882
Full Changelog: 0.18.19...0.18.20
0.18.19
This release contains CLI hotfixes for 0.18.18
, including a fix for client backward compatibility and a fix for reported memory usage in dstack stats
. It's recommended to update the CLI from 0.18.18
to 0.18.19
. The server update from 0.18.18
is not necessary.
What's Changed
- Fix Trl example by @Bihan in #1851
- Fix dstack client 0.18.18 compatibility with older servers by @un-def in #1850
- [Docs]: Fix frontend build instructions by @jvstme in #1849
- Show working set memory in dstack stats by @r4victor in #1856
Full Changelog: 0.18.18...0.18.19