Introducing the New QuantFlow Website
We're excited to introduce the new QuantFlow website — a platform designed to communicate both the system we are building and the ideas behind it.
This is not just a product site. It is a place where system design, quantitative research, and practical implementation come together.
Why a New Website?
QuantFlow sits at the intersection of data engineering, quantitative research, and machine learning.
To properly understand its value, it's not enough to describe features — we also need to explain:
- how the system is designed
- why certain architectural decisions were made
- how it fits into real-world quantitative workflows
What We'll Share
1. System Design
We will provide detailed insights into how QuantFlow is built across all four components:
DataInfra — the engine-agnostic data foundation:
- Multi-source ingestion with declarative feed provider YAML
- Common Data Model (CDM) with Pydantic validation
- QFSQL — an engine-agnostic SQL dialect for field mappings, compiling to BigQuery, Snowflake, DuckDB, and PostgreSQL
- Auto-generated dbt pipelines and four-layer data quality enforcement
MarketState — market structure reconstruction:
- 8 bar types (fixed + information-driven) via a single-pass Numba fused kernel
- Order book snapshot reconstruction from tick data
- Label Engine with triple barrier, fixed horizon return, trend scanning, and time-series labeling
FeatureDAG — the compiler for quantitative features:
- Formula Language — a mathematical DSL with ~40 functions compiled to an IR DAG via Python's
astmodule - 125+ FeatureTypes and 14 MFP packs across 6 dimensions
- 4-stage pipeline: AST compiler → IR DAG → lowering → execution
- 50+ compile-time schema contracts catch errors before any data is touched
Execution Layer — dual-backend runtime:
- Batch (Polars) for research — lazy evaluation, Arrow zero-copy, in-process deployment
- Streaming (DolphinDB) for live trading — deploy-and-forget, sub-ms latency, consolidated engines
- Mode polymorphism: tick / bar / tick_to_bar
- Dagster orchestrates the batch pipeline with 5-stage asset lineage and per-stage retries
2. Product and Business Perspective
Beyond the system itself, we will discuss:
- how quantitative teams build and scale research pipelines
- the challenges of data fragmentation and feature engineering
- where QuantFlow fits within the broader quant ecosystem
- design trade-offs between flexibility, performance, and usability
3. Theoretical Foundations
We will also explore the underlying concepts that inform the system:
- market microstructure and event-driven data
- financial data modeling and time alignment
- feature engineering for machine learning
- causality and leakage prevention
Our Goal
The goal of this platform is to bridge:
- system design and real-world usage
- practical engineering and theoretical understanding
We aim to make QuantFlow not only a tool, but also a reference point for how modern quantitative systems are built.
Explore
- System Overview — architecture and component design
- Feature Library — 125+ FeatureTypes and 14 MFP packs
- QFDSL Reference — QFSQL and Formula Language references
- Quickstart — get started in 5 minutes
We're building QuantFlow as both a system and a framework for thinking about quantitative finance.
— The QuantFlow Team