This article demonstrates how to create new data joins in the dataset Flight Delays, based on data previously imported into Arcadia from the datafile flights.csv. The additional data files are airline-id.csv, airport-codes.csv, cancellation-code.csv, and airport-lat-long.csv.
The following steps demonstrate how to make new joins.
On the main navigation bar, click Data.
The Data view appears, open on the Datasets tab.
Create a new dataset based on the datafile flights.csv.
Find the dataset in the list of datasets, either by scrolling or by using search, and click on it.
Dataset side navigation appears, open at Dataset Detail view.
In the side navigation menu, click Data Model.
Data Model view appears, and shows the name of the only table in the dataset. You may click Show Data to display the data of that table.
Click Edit Data Model to edit the data model.
Click the sign on the table representation.
The Table Browser modal window appears.
In the Table Browser modal window, make the following selections:
In the Database Name selector, choose the data source.
Note that you can join tables from different databases. This value is pre-populated to match the dataset's existing table, but it may be changed.
In the Table Name selector, choose the table name airline_id
.
This value is pre-populated to match the existing table of the dataset, but it may be changed.
The Edit Join modal window appears.
In the Edit Join modal window, select the matching columns for both tables.
These values are pre-populated by default when there is a natural match between the two tables (like identical field name and value types), but they may be changed.
AIRLINE_ID
. On the right side, select the field code
.Repeat the previous three steps for the remaining tables:
airport_codes
has two joins, for source column
ORIGIN
= target column code
, and source column
DEST
= target column code
. After applying the first
join, click Add Join Pair, and then specify the second join.cancellation_code
has a join for source column CANCELLATION_CODE
= target column code
.airport_lat_long
has two joins, for source column ORIGIN
= target column locationid
, and source column DEST
= target column locationid
.Click Save.