Cosdata Roadmap
Detailed timeline and feature status for our journey from MVP to production-ready enterprise solution
Navigation
Timeline
Feature Status
CompletedFeatures we've finished
In ProgressCurrently working on
TodoComing next
Development Timeline
December 15, 2024
MVP/Alpha: Vector Database
- Optimized HNSW (dense) and Inverted index (sparse) implementations
- RESTful API for core operations
- SIMD optimized major distance metrics and quantization
- Versioning & "Transaction-as-a-resource"
- Run basic comparison benchmarks for HNSW & Inverted index (SPLADE)
January 30, 2025
MVP/Alpha: Graph Database
- Basic graph data structures and operations
- Simple integration with vector search
- Rudimentary CosQL for graph queries
March 15, 2025
Beta: Graph Database
- Advanced graph algorithms and knowledge graph features
- Enhanced CosQL with graph-specific operations
- Basic rule evaluation engine
June 15, 2025
RC/GA: Graph Database
- Full graph database capabilities
- Seamless integration of graph and vector search
- Advanced knowledge graph operations and querying
June 15, 2025
Beta: Cloud Services
- Multi-cloud support and improved resource management
- Enhanced monitoring and basic serverless functions
- Development of comprehensive web application
- Initial integration with major cloud ecosystems
August 30, 2025
RC/GA: Cloud Services
- Fully automated deployment and scaling
- Production-ready web application with full feature set
- Comprehensive management and analytics interface
- High availability and redundancy features
- Complete integration with major cloud ecosystems
Feature Status
Indexing and Search
Feature | Status | Phase |
---|---|---|
HNSW indexing for dense vectors with high dimensionality support | Completed | MVP/ALPHA |
Inverted Index for sparse vectors (Splade & BM25), supporting very high dimensionality | Completed | MVP/ALPHA |
ANN probabilistic search for Inverted Index | Completed | MVP/ALPHA |
Benchmarking Inverted Index against proprietary data type offerings | In Progress | BETA |
Optimized hybrid search algorithms | Todo | MVP/ALPHA |
Advanced indexing optimizations | Todo | BETA |
Complete end-to-end comparison benchmarking of HNSW & Inverted Index | Todo | BETA |
Implement re-ranker integration | Todo | RC/GA |
Distance Metrics and Quantization
Feature | Status | Phase |
---|---|---|
Dot product | Completed | MVP/ALPHA |
Cosine Similarity | Completed | MVP/ALPHA |
Euclidean | Completed | MVP/ALPHA |
Hamming | Todo | MVP/ALPHA |
SIMD optimizations for cosine & dot product metrics | Completed | MVP/ALPHA |
Binary (base 2) quantization | Completed | MVP/ALPHA |
Quaternary (base 4) quantization | Completed | MVP/ALPHA |
Octal (base 8) quantization | Completed | MVP/ALPHA |
U8 (base 256) quantization | Completed | MVP/ALPHA |
Sub-Byte Quantization of Inverted Index | In Progress | BETA |
SIMD optimizations for all quantization methods | In Progress | RC/GA |
Implementing auto-configuration for optimal quantization and storage based on statistical sampling | In Progress | BETA |
Storage and Performance
Feature | Status | Phase |
---|---|---|
Buffered IO, equivalent to memory mapped files for efficient caching | Completed | MVP/ALPHA |
Custom storage layer with serialization of index and corresponding file formats | Completed | MVP/ALPHA |
Lazy Loading of index nodes, fulfilling DiskANN requirements for low memory use | Completed | MVP/ALPHA |
LRU cache for lazy loaded items | Completed | MVP/ALPHA |
Separation of compute & storage architecture | Completed | MVP/ALPHA |
Advanced caching strategies | Todo | BETA |
Distributed storage support | Todo | RC/GA |
Implement advanced sharding for multi-billion scale datasets | Todo | RC/GA |
Enhance high availability and redundancy features | Todo | RC/GA |
Data Management and Versioning
Feature | Status | Phase |
---|---|---|
Versioning with transaction-based historical revisions and branching | Completed | MVP/ALPHA |
Lazy loadable collections (Set, Map, Vec, Array, EagerLazyLoad, etc) | Completed | MVP/ALPHA |
Auto creation of indexes | Completed | MVP/ALPHA |
Advanced versioning features, like branching & related APIs | Todo | BETA |
Improve usability of versioning system | Todo | BETA |
Multi-modal data support | Todo | RC/GA |
Add native support for storing documents and multi-modal data types | Todo | RC/GA |
Query and API
Feature | Status | Phase |
---|---|---|
RESTful API (upsert, ANN, collection create, create index) | Completed | MVP/ALPHA |
Developing user-facing RESTful API for Inverted Index | In Progress | BETA |
Integrating HNSW hyperparameters API | In Progress | BETA |
GraphQL API support | Todo | RC/GA |
Implement metadata filtering | Todo | BETA |
Graph Database and Knowledge Graph
Feature | Status | Phase |
---|---|---|
Cos Query Language (CosQL) specification | Completed | MVP/ALPHA |
Rule, Fact, Schema parser for data definition, manipulation & querying | Completed | MVP/ALPHA |
Rule evaluation engine (detailed design document created) | Completed | MVP/ALPHA |
Enhanced CosQL features | Todo | BETA |
Enhance graph database rule evaluation engine and improve performance | Todo | BETA |
Integrate LLM/model for natural language querying of knowledge graphs and relational data | Todo | RC/GA |
Implement Agentic Memory capabilities | Todo | RC/GA |
Cloud Integration and Web Application
Feature | Status | Phase |
---|---|---|
Prototype web-based management interface | Todo | MVP/ALPHA |
Begin development of comprehensive web application | Todo | BETA |
Implement basic serverless functions | Todo | BETA |
Integrate with major cloud ecosystems (initial) | Todo | BETA |
Release production-ready web application | Todo | RC/GA |
Implement advanced serverless functions | Todo | RC/GA |
Fully integrate with major cloud ecosystems | Todo | RC/GA |
Develop comprehensive analytics features in web application | Todo | RC/GA |
Integration and Ecosystem
Feature | Status | Phase |
---|---|---|
Integrate with major text and image vectorization models | Todo | RC/GA |
Integrate with LangChain, LlamaIndex, and similar frameworks | Todo | RC/GA |
Develop web application and cloud serverless integration with major ecosystems | Todo | RC/GA |
Security and Access Control
Feature | Status | Phase |
---|---|---|
Develop authentication and IAM user roles for filtering/joining HNSW and Inverted indexes | Todo | RC/GA |
Ongoing Improvements
Feature | Status | Phase |
---|---|---|
Ongoing bug fixes and performance improvements | In Progress | ALL PHASES |