Tools & Products
At Query.Farm, weโre building powerful tools and services that extend the capabilities of DuckDB โ making it faster, more flexible, and easier to integrate into production systems. Our extensions help teams process terabytes of data 10x faster, build real-time analytics pipelines, and solve complex data challenges with SQL.
We create open source extensions, developer utilities, and commercial software for teams working with embedded analytics.
๐ ๏ธ Our Products
Our DuckDB extensions have been downloaded 2,000,000+ times by data teams worldwide.
Data Engineering & ETL
Airport: Connect DuckDB to remote data sources via Apache Arrow Flight for high-performance data access and sharing. Airport allows efficient querying of remote data from any any data format.
ShellFS: Stream data in/out of DuckDB using shell commandsโperfect for ETL, automation, and UNIX-style workflows.
Streaming & Real-Time Analytics
Tributary: Integrate DuckDB with Apache Kafka for real-time querying and analysis of streaming data.
Radio: Enable DuckDB to send/receive events via WebSockets and Redis Pub/Sub.
Probabilistic & Approximate Analytics
Bitfilters: Fast, space-efficient set membership and duplicate detection using advanced probabilistic filters.
Datasketches: Scalable approximate analytics (distinct counts, quantiles, set operations) with Apache DataSketches.
Search, Matching & Completion
Fuzzycomplete: Fuzzy SQL completion for intuitive, context-aware suggestions.
Rapidfuzz: High-performance fuzzy string matching for deduplication, search, and data cleaning.
Marisa: Fast string lookups and prefix searches using MARISA tries.
Hashing & Security
Statistical Analysis
- Stochastic: Comprehensive statistical distribution functions for probability, sampling, and analytics.
Visualization
- Textplot: Text-based data visualizationโbar charts, density plots, and more directly in SQL.
Advanced SQL Logic & Spatial Data
EvalExpr_Rhai: Inline scripting with Rhai for custom logic and dynamic calculations in SQL.
Lindel: Spatial indexing and multi-dimensional data linearization using Hilbert and Morton/Z-Order curves.
DuckDB Extension Use Cases
Some of the ways that our products have been used are:
Real-time Analytics Pipeline: Combined Tributary (Kafka integration) + Radio (WebSocket events) + Bitfilters (duplicate detection) for live data processing.
Data Quality Workflow: Use Rapidfuzz (fuzzy matching) + Hashfuncs (fingerprinting) + Stochastic (statistical validation) to clean and validate datasets.
High-Performance ETL: Leverage Airport (Arrow Flight) + ShellFS (shell integration) + Crypto (data integrity) for secure, fast data movement.
๐ฆ Custom Products
Need something specific?
We build custom DuckDB extensions and tools tailored to your needs. Whether itโs a file format, a data source, or a connector, we can help.
๐ฌ [email protected]
Be the First to Know at Query.Farm
Get exclusive access to new features, SQL tricks, and exciting announcements delivered right to your inbox. We only send emails when thereโs something awesome.