Declare
The declare
component in DUQL allows you to define reusable elements such as variables, functions, and subqueries. These declarations can be referenced throughout your DUQL query, promoting code reuse and improving query readability.
Syntax
The declare
section is typically placed at the beginning of a DUQL query and can contain multiple declarations:
Types of Declarations
Simple Variables
You can declare simple variables with literal values or expressions.
Example:
Functions
Functions can be declared using a simplified arrow syntax or a more detailed YAML structure.
Simple syntax:
Detailed syntax:
Subqueries
Subqueries are declared as complete DUQL pipelines that can be reused in your main query.
Example:
Examples
Mixed Declarations
Using Declarations in a Query
Best Practices
📝 Use clear and descriptive names for your declarations to improve query readability.
🔄 Leverage declarations to avoid repetition in your queries.
🧩 Break down complex logic into smaller, reusable functions.
🏗️ Use subquery declarations to create modular and maintainable query components.
📊 Consider performance implications when using complex subqueries in declarations.
🔍 Document your declarations, especially for complex functions or subqueries.
Real-World Use Case
Here's an example of a DUQL query that makes extensive use of declarations:
This query demonstrates:
Declaring constants (
tax_rate
,shipping_threshold
)Defining reusable functions (
calculate_total
,apply_shipping
)Creating subquery declarations (
recent_customers
,product_categories
)Using these declarations throughout the main query for calculations, filtering, and joins
The resulting top_customer_category_analysis
dataset provides insights into the top-spending customers by product category, incorporating tax and shipping calculations.
💡 Tip: The
declare
section is a powerful tool for creating reusable and maintainable DUQL queries. Use it to define your business logic once and apply it consistently throughout your data analysis pipeline!
Into
The into
component specifies the destination for your query results. It's similar to declaring a variable, but it occurs at the end of a query pipeline. When you use into
, you're essentially creating a named result set that can be referenced in subsequent queries or operations.
Key points about into
:
It functions like a variable declaration that happens at the end of a query.
The result of all preceding steps in the query is stored in the named variable specified by
into
.This named result can be used as a dataset in other DUQL queries within the same session or script.
Example:
In this example, monthly_sales_report
becomes a named dataset containing the results of the query. You can then use it in subsequent queries like this:
The key difference between into
and a variable declared in the declare
section is the timing and context:
Variables in
declare
are defined before the main query pipeline and can be used throughout the query.into
creates a named result at the end of the query pipeline, making the final result available for future use.
Think of into
as a way to save your query results for further analysis or as building blocks for more complex data operations. It's particularly useful when you want to break down a complex analysis into multiple, manageable DUQL queries.
Last updated
Was this helpful?