Join
The join
function in DUQL is used to combine rows from two or more tables based on a related column between them. It supports different types of joins and allows for complex join conditions.
Syntax
Parameters
dataset
object
Yes
-
The data source to join with (ref: dataset.s.duql.yml)
where
string
Yes
-
The join condition
retain
string
No
"inner"
The type of join to perform
Behavior
Combines rows from the current dataset with rows from the specified dataset based on the join condition.
The
dataset
parameter can include its ownsteps
for preprocessing before the join.The
where
condition supports a==
shorthand for equality joins on matching column names.The
retain
parameter specifies the join type, defaulting to an inner join.
Join Types
"inner"
Returns only the matched rows (default)
"left"
Returns all rows from the left table and matched rows from the right table
"right"
Returns all rows from the right table and matched rows from the left table
"full"
Returns all rows when there is a match in either left or right table
Examples
Basic Join
Left Join
Join with CSV File
Join with Subquery and Full Outer Join
Complex Join with Aggregation
Join with SQL Query
Join with External Data Source
Best Practices
🎯 Always specify the join condition explicitly for clarity.
🤔 Choose the appropriate join type using the
retain
parameter based on your data requirements.🚀 Utilize the
steps
within thedataset
parameter for preprocessing before joining.🔍 Use the
==
shorthand for simple equality joins when column names match.📊 Be mindful of data volume when performing joins, especially with large datasets or complex subqueries.
🏷️ Consider using aliases or clear naming conventions when joining multiple tables to avoid ambiguity.
The join
operations allow us to bring together data from multiple sources, creating a rich dataset for analysis that includes sales performance, customer behavior, inventory status, and category-level benchmarks.
💡 Tip: The
join
function in DUQL is highly flexible, allowing you to combine data from various sources and formats. Leverage its ability to includesteps
within thedataset
parameter to preprocess your data before joining, enabling more sophisticated data integration in your analyses!
Last updated
Was this helpful?