Cosdata Roadmap

Detailed timeline and feature status for our journey from MVP to production-ready enterprise solution

Navigation

Timeline

Development Timeline

Feature Status

Indexing and Search
Distance Metrics and Quantization
Storage and Performance
Data Management and Versioning
Query and API
Graph Database and Knowledge Graph
Cloud Integration and Web Application
Integration and Ecosystem
Security and Access Control
Ongoing Improvements

CompletedFeatures we've finished

In ProgressCurrently working on

TodoComing next

Development Timeline

December 15, 2024

MVP/Alpha: Vector Database

Optimized HNSW (dense) and Inverted index (sparse) implementations
RESTful API for core operations
SIMD optimized major distance metrics and quantization
Versioning & "Transaction-as-a-resource"
Run basic comparison benchmarks for HNSW & Inverted index (SPLADE)

January 30, 2025

MVP/Alpha: Graph Database

Basic graph data structures and operations
Simple integration with vector search
Rudimentary CosQL for graph queries

March 15, 2025

Beta: Graph Database

Advanced graph algorithms and knowledge graph features
Enhanced CosQL with graph-specific operations
Basic rule evaluation engine

June 15, 2025

RC/GA: Graph Database

Full graph database capabilities
Seamless integration of graph and vector search
Advanced knowledge graph operations and querying

June 15, 2025

Beta: Cloud Services

Multi-cloud support and improved resource management
Enhanced monitoring and basic serverless functions
Development of comprehensive web application
Initial integration with major cloud ecosystems

August 30, 2025

RC/GA: Cloud Services

Fully automated deployment and scaling
Production-ready web application with full feature set
Comprehensive management and analytics interface
High availability and redundancy features
Complete integration with major cloud ecosystems

Feature Status

Indexing and Search

Feature	Status	Phase
HNSW indexing for dense vectors with high dimensionality support	Completed	MVP/ALPHA
Inverted Index for sparse vectors (Splade & BM25), supporting very high dimensionality	Completed	MVP/ALPHA
ANN probabilistic search for Inverted Index	Completed	MVP/ALPHA
Benchmarking Inverted Index against proprietary data type offerings	In Progress	BETA
Optimized hybrid search algorithms	Todo	MVP/ALPHA
Advanced indexing optimizations	Todo	BETA
Complete end-to-end comparison benchmarking of HNSW & Inverted Index	Todo	BETA
Implement re-ranker integration	Todo	RC/GA

Distance Metrics and Quantization

Feature	Status	Phase
Dot product	Completed	MVP/ALPHA
Cosine Similarity	Completed	MVP/ALPHA
Euclidean	Completed	MVP/ALPHA
Hamming	Todo	MVP/ALPHA
SIMD optimizations for cosine & dot product metrics	Completed	MVP/ALPHA
Binary (base 2) quantization	Completed	MVP/ALPHA
Quaternary (base 4) quantization	Completed	MVP/ALPHA
Octal (base 8) quantization	Completed	MVP/ALPHA
U8 (base 256) quantization	Completed	MVP/ALPHA
Sub-Byte Quantization of Inverted Index	In Progress	BETA
SIMD optimizations for all quantization methods	In Progress	RC/GA
Implementing auto-configuration for optimal quantization and storage based on statistical sampling	In Progress	BETA

Storage and Performance

Feature	Status	Phase
Buffered IO, equivalent to memory mapped files for efficient caching	Completed	MVP/ALPHA
Custom storage layer with serialization of index and corresponding file formats	Completed	MVP/ALPHA
Lazy Loading of index nodes, fulfilling DiskANN requirements for low memory use	Completed	MVP/ALPHA
LRU cache for lazy loaded items	Completed	MVP/ALPHA
Separation of compute & storage architecture	Completed	MVP/ALPHA
Advanced caching strategies	Todo	BETA
Distributed storage support	Todo	RC/GA
Implement advanced sharding for multi-billion scale datasets	Todo	RC/GA
Enhance high availability and redundancy features	Todo	RC/GA

Data Management and Versioning

Feature	Status	Phase
Versioning with transaction-based historical revisions and branching	Completed	MVP/ALPHA
Lazy loadable collections (Set, Map, Vec, Array, EagerLazyLoad, etc)	Completed	MVP/ALPHA
Auto creation of indexes	Completed	MVP/ALPHA
Advanced versioning features, like branching & related APIs	Todo	BETA
Improve usability of versioning system	Todo	BETA
Multi-modal data support	Todo	RC/GA
Add native support for storing documents and multi-modal data types	Todo	RC/GA

Query and API

Feature	Status	Phase
RESTful API (upsert, ANN, collection create, create index)	Completed	MVP/ALPHA
Developing user-facing RESTful API for Inverted Index	In Progress	BETA
Integrating HNSW hyperparameters API	In Progress	BETA
GraphQL API support	Todo	RC/GA
Implement metadata filtering	Todo	BETA

Graph Database and Knowledge Graph

Feature	Status	Phase
Cos Query Language (CosQL) specification	Completed	MVP/ALPHA
Rule, Fact, Schema parser for data definition, manipulation & querying	Completed	MVP/ALPHA
Rule evaluation engine (detailed design document created)	Completed	MVP/ALPHA
Enhanced CosQL features	Todo	BETA
Enhance graph database rule evaluation engine and improve performance	Todo	BETA
Integrate LLM/model for natural language querying of knowledge graphs and relational data	Todo	RC/GA
Implement Agentic Memory capabilities	Todo	RC/GA

Cloud Integration and Web Application

Feature	Status	Phase
Prototype web-based management interface	Todo	MVP/ALPHA
Begin development of comprehensive web application	Todo	BETA
Implement basic serverless functions	Todo	BETA
Integrate with major cloud ecosystems (initial)	Todo	BETA
Release production-ready web application	Todo	RC/GA
Implement advanced serverless functions	Todo	RC/GA
Fully integrate with major cloud ecosystems	Todo	RC/GA
Develop comprehensive analytics features in web application	Todo	RC/GA

Integration and Ecosystem

Feature	Status	Phase
Integrate with major text and image vectorization models	Todo	RC/GA
Integrate with LangChain, LlamaIndex, and similar frameworks	Todo	RC/GA
Develop web application and cloud serverless integration with major ecosystems	Todo	RC/GA

Security and Access Control

Feature	Status	Phase
Develop authentication and IAM user roles for filtering/joining HNSW and Inverted indexes	Todo	RC/GA

Ongoing Improvements

Feature	Status	Phase
Ongoing bug fixes and performance improvements	In Progress	ALL PHASES