Tools & Products
At Query.Farm, weโre building powerful tools and services that extend the capabilities of DuckDB โ making it faster, more flexible, and easier to integrate into production systems. Our extensions help teams process terabytes of data 10x faster, build real-time analytics pipelines, and solve complex data challenges with SQL.
We create open source extensions, developer utilities, and commercial software for teams working with embedded analytics.
๐ ๏ธ Our Products
Our DuckDB extensions have been downloaded 2,000,000+ times by data teams worldwide.
Data Engineering & ETL
Airport: Connect DuckDB to remote data sources via Apache Arrow Flight for high-performance data access and sharing. Airport allows efficient querying of remote data from any any data format.
ShellFS: Stream data in/out of DuckDB using shell commandsโperfect for ETL, automation, and UNIX-style workflows.
Streaming & Real-Time Analytics
Tributary: Integrate DuckDB with Apache Kafka for real-time querying and analysis of streaming data.
Radio: Enable DuckDB to send/receive events via WebSockets and Redis Pub/Sub.
Probabilistic & Approximate Analytics
Bitfilters: Fast, space-efficient set membership and duplicate detection using advanced probabilistic filters.
Datasketches: Scalable approximate analytics (distinct counts, quantiles, set operations) with Apache DataSketches.
Search, Matching & Completion
Fuzzycomplete: Fuzzy SQL completion for intuitive, context-aware suggestions.
Rapidfuzz: High-performance fuzzy string matching for deduplication, search, and data cleaning.
Marisa: Fast string lookups and prefix searches using MARISA tries.
Hashing & Security
Statistical Analysis
- Stochastic: Comprehensive statistical distribution functions for probability, sampling, and analytics.
Visualization
- Textplot: Text-based data visualizationโbar charts, density plots, and more directly in SQL.
Advanced SQL Logic & Spatial Data
EvalExpr_Rhai: Inline scripting with Rhai for custom logic and dynamic calculations in SQL.
Lindel: Spatial indexing and multi-dimensional data linearization using Hilbert and Morton/Z-Order curves.
DuckDB Extension Use Cases
Some of the ways that our products have been used are:
Real-time Analytics Pipeline: Combined Tributary (Kafka integration) + Radio (WebSocket events) + Bitfilters (duplicate detection) for live data processing.
Data Quality Workflow: Use Rapidfuzz (fuzzy matching) + Hashfuncs (fingerprinting) + Stochastic (statistical validation) to clean and validate datasets.
High-Performance ETL: Leverage Airport (Arrow Flight) + ShellFS (shell integration) + Crypto (data integrity) for secure, fast data movement.
๐ฆ Custom Products
Need something specific?
We build custom DuckDB extensions and tools tailored to your needs. Whether itโs a file format, a data source, or a connector, we can help.
๐ฌ hello@query.farm
Be the First to Know at Query.Farm
Get exclusive access to new features, SQL tricks, and exciting announcements delivered right to your inbox. We only send emails when thereโs something awesome.