- Rust 95.6%
- Python 2%
- Shell 2%
- C 0.1%
- Jinja 0.1%
|
|
||
|---|---|---|
| .cargo | ||
| .config | ||
| .devcontainer | ||
| .github | ||
| agents | ||
| benchmark | ||
| docker | ||
| licenses | ||
| scripts | ||
| src | ||
| tests | ||
| .editorconfig | ||
| .gitattributes | ||
| .gitignore | ||
| .typos.toml | ||
| .yamllint.yml | ||
| AGENTS.md | ||
| Cargo.lock | ||
| Cargo.toml | ||
| clippy.toml | ||
| LICENSE | ||
| licenserc-ee.toml | ||
| licenserc.toml | ||
| Makefile | ||
| pyproject.toml | ||
| README.md | ||
| ruff.toml | ||
| rust-toolchain.toml | ||
| rustfmt.toml | ||
| taplo.toml | ||
| uv.lock | ||
Databend
Enterprise Data Warehouse for AI Agents
Large-scale analytics, vector search, full-text search — with flexible agent orchestration and secure Python UDF sandboxes. Built for enterprise AI workloads.
💡 Why Databend?
Databend is an open-source enterprise data warehouse built in Rust.
Core capabilities: Analytics, vector search, full-text search, auto schema evolution — unified in one engine.
Agent-ready: Sandbox UDFs for agent logic, SQL for orchestration, transactions for reliability, branching for safe experimentation on production data.
| 📊 Core Engine Analytics, vector search, full-text search, auto schema evolution, transactions. |
🤖 Agent-Ready Sandbox UDF + SQL orchestration. Build and run agents on your enterprise data. |
| 🏢 Enterprise Scale Elastic compute, cloud native. S3/Azure/GCS. |
🌿 Branching Git-like data versioning. Agents safely operate on production snapshots. |
⚡ Quick Start
1. Cloud (Recommended)
Start for free on Databend Cloud — Production-ready in 60 seconds.
2. Local (Python)
Ideal for development and testing:
pip install databend
import databend
ctx = databend.SessionContext()
ctx.sql("SELECT 'Hello, Databend!'").show()
3. Docker
Run the full warehouse locally:
docker run -p 8000:8000 datafuselabs/databend
🤖 Agent-Ready Architecture
Databend's Sandbox UDF enables flexible agent orchestration with a three-layer architecture:
- Control Plane: Resource scheduling, permission validation, sandbox lifecycle management
- Execution Plane (Databend): SQL orchestration, issues requests via Arrow Flight
- Compute Plane (Sandbox Workers): Isolated sandboxes running your agent logic
-- Define your agent logic
CREATE FUNCTION my_agent(input STRING) RETURNS STRING
LANGUAGE python HANDLER = 'run'
AS $$
def run(input):
# Your agent logic: LLM calls, tool use, reasoning...
return response
$$;
-- Orchestrate agents with SQL
SELECT my_agent(question) FROM tasks;
🚀 Use Cases
- AI Agents: Sandbox UDF + SQL orchestration + branching for safe operations
- Analytics & BI: Large-scale SQL analytics — Learn more
- Search & RAG: Vector + full-text search — Learn more
🤝 Community & Support
Contributors are immortalized in the system.contributors table 🏆
📄 License
Apache 2.0 + Elastic 2.0 | Licensing FAQ