Query Structure
DUQL is a powerful and intuitive query language designed for data transformation and analysis. It provides a structured way to define complex data operations using a human-readable YAML format.
Syntax
A fully defined DUQL query has the following structure:
Components
Settings
version
string
No
The version of DUQL being used
target
string
No
The target database or SQL dialect for the query
Example:
Declare
The declare
section allows you to define variables, functions, or reusable query components.
Example:
Dataset
The dataset
component specifies the main data source for your query. It can be a simple table name or a more complex definition.
Example:
or
Steps
The steps
section is where you define your data transformation pipeline. Each step represents an operation on your data.
Available steps include:
filter
: Select rows based on conditionsjoin
: Combine data from multiple sourcesselect
: Choose or compute columnsgroup
: Aggregate datasort
: Order resultstake
: Limit the number of rowsgenerate
: Create new columnsAnd more...
Example:
Into
The into
component specifies the destination for your query results.
Example:
Best Practices
🏗️ Structure your query logically, starting with data sources and progressing through transformations.
🧩 Use the
declare
section to create reusable components and improve query readability.📊 Leverage the power of DUQL's expressive steps to perform complex data operations.
🎯 Be explicit in your joins and filters to ensure data integrity.
🚀 Optimize your query by placing filters early in the pipeline and using appropriate indexes.
📝 Use meaningful names for variables and result sets to enhance query understandability.
🔍 Take advantage of DUQL's ability to work with various data sources and formats.
Real-World Use Case
Here's an example of a DUQL query that analyzes customer purchasing behavior:
This query:
Defines recent customers
Joins order data with customer and product information
Calculates total amount per order
Groups and summarizes purchases by customer, month, and product category
Sorts the results
Categorizes customers based on their spending
The resulting customer_purchase_analysis
dataset provides valuable insights into customer behavior, allowing for targeted marketing strategies and improved customer relationship management.
Last updated
Was this helpful?