Milvus is an open-source project under the LF AI & Data Foundation[2] and is distributed under the Apache License 2.0.
History
Milvus has been developed by Zilliz since 2017.[3]
Milvus joined Linux Foundation as an incubation project in January 2020 and became a graduate in June 2021.[2] The details about its architecture and possible applications were presented at ACM SIGMOD Conference in 2021.[4]
Milvus 2.0, a major redesign of the whole product with a new architecture,[5] was released in January 2022.
Milvus can be deployed as an embedded database, standalone server, or distributed cluster. Zilliz Cloud offers a fully managed version.[16]
GPU support
Milvus provides GPU accelerated index building and search using NvidiaCUDA technology[17][18] via the Nvidia RAFT library,[19] including a recent GPU-based graph indexing algorithm known as CAGRA.[20]
Several storage providers have built integrations with Milvus to support AI workloads and large-scale vector search. These integrations aim to optimize performance, simplify inferencing workflows, and enhance data management capabilities:
^Subramanya, Suhas Jayaram; Kadekodi, Rohan; Krishaswamy, Ravishankar; Simhadri, Harsha Vardhan (8 December 2019). "DiskANN: fast accurate billion-point nearest neighbor search on a single node". Proceedings of the 33rd International Conference on Neural Information Processing Systems. Curran Associates Inc.: 13766–13776.
^Wang, Mengzhao; Xu, Weizhi; Yi, Xiaomeng; Wu, Songlin; Peng, Zhangyang; Ke, Xiangyu; Gao, Yunjun; Xu, Xiaoliang; Guo, Rentong; Xie, Charles (2024). "Starling: An I/O-Efficient Disk-Resident Graph Index Framework for High-Dimensional Vector Similarity Search on Data Segment". Proceedings of the ACM on Management of Data. 2: 1–27. arXiv:2401.02116. doi:10.1145/3639269.