Tools & Products

At Query.Farm, weโ€™re building powerful tools and services that extend the capabilities of DuckDB โ€” making it faster, more flexible, and easier to integrate into production systems. Our extensions help teams process terabytes of data 10x faster, build real-time analytics pipelines, and solve complex data challenges with SQL.

We create open source extensions, developer utilities, and commercial software for teams working with embedded analytics.

๐Ÿ› ๏ธ Our Products

Our DuckDB extensions have been downloaded 2,000,000+ times by data teams worldwide.

Data Engineering & ETL

  • Airport: Connect DuckDB to remote data sources via Apache Arrow Flight for high-performance data access and sharing. Airport allows efficient querying of remote data from any any data format.

  • ShellFS: Stream data in/out of DuckDB using shell commandsโ€”perfect for ETL, automation, and UNIX-style workflows.

Streaming & Real-Time Analytics

  • Tributary: Integrate DuckDB with Apache Kafka for real-time querying and analysis of streaming data.

  • Radio: Enable DuckDB to send/receive events via WebSockets and Redis Pub/Sub.

Probabilistic & Approximate Analytics

  • Bitfilters: Fast, space-efficient set membership and duplicate detection using advanced probabilistic filters.

  • Datasketches: Scalable approximate analytics (distinct counts, quantiles, set operations) with Apache DataSketches.

Search, Matching & Completion

  • Fuzzycomplete: Fuzzy SQL completion for intuitive, context-aware suggestions.

  • Rapidfuzz: High-performance fuzzy string matching for deduplication, search, and data cleaning.

  • Marisa: Fast string lookups and prefix searches using MARISA tries.

Hashing & Security

  • Hashfuncs: Non-cryptographic hash functions for indexing, partitioning, and Bloom filters.

  • Crypto: Cryptographic hash functions and HMAC for data integrity and authentication.

Statistical Analysis

  • Stochastic: Comprehensive statistical distribution functions for probability, sampling, and analytics.

Visualization

  • Textplot: Text-based data visualizationโ€”bar charts, density plots, and more directly in SQL.

Advanced SQL Logic & Spatial Data

  • EvalExpr_Rhai: Inline scripting with Rhai for custom logic and dynamic calculations in SQL.

  • Lindel: Spatial indexing and multi-dimensional data linearization using Hilbert and Morton/Z-Order curves.

DuckDB Extension Use Cases

Some of the ways that our products have been used are:

Real-time Analytics Pipeline: Combined Tributary (Kafka integration) + Radio (WebSocket events) + Bitfilters (duplicate detection) for live data processing.

Data Quality Workflow: Use Rapidfuzz (fuzzy matching) + Hashfuncs (fingerprinting) + Stochastic (statistical validation) to clean and validate datasets.

High-Performance ETL: Leverage Airport (Arrow Flight) + ShellFS (shell integration) + Crypto (data integrity) for secure, fast data movement.

๐Ÿ“ฆ Custom Products

Need something specific?

We build custom DuckDB extensions and tools tailored to your needs. Whether itโ€™s a file format, a data source, or a connector, we can help.

๐Ÿ“ฌ hello@query.farm


Be the First to Know at Query.Farm

Get exclusive access to new features, SQL tricks, and exciting announcements delivered right to your inbox. We only send emails when thereโ€™s something awesome.