This article demonstrates how to create new data joins in a dataset.
To demonstrate how to create new data joins, we used dataset Flight Delays, based on data previously imported into Arcadia from the datafile flights-2014.zip
The additional datafiles are airline-id.csv, airport-codes.csv, cancellation-code.csv, airport-lat-long.csv, and state-abbreviations.csv.
The following steps demonstrate how to make new joins:
On the main navigation bar, click Data.
The Data view appears, open on the Datasets tab.
Create a new dataset based on the datafile flights_2014.csv (compressed).
Find the dataset in the list of datasets, either by scrolling or by using search, and click on it.
Dataset side navigation appears, open at Dataset Detail view.
In the side navigation menu, click Data Model.
The Data Model view appears, and shows the name of the only table in the dataset. You may click Show Data to display the data of that table.
Click Edit Data Model to edit the data model.
Click the sign on the table representation.
The Table Browser modal window appears.
In the Table Browser modal window, make the following selections:
In the Database Name selector, choose the database documentation.
Note that you can join tables from different databases. This value is pre-populated to match the dataset's existing table, but it may be changed.
In the Table Name selector, choose the table name
airline_id
.
This value is pre-populated to match the existing table of the dataset, but it may be changed.
The Edit Join modal window appears.
In the Edit Join modal window, the following options are available:
airlineid
. On the right side, select the field
code
.Click Apply.
Repeat the previous two steps to create seven more joins as follows:
The table airport_codes
has two joins to the main table, and you
must create each join separately as follows:
origin
= right column code
.dest
= right column code
.cancellation_code
has a join for left column
cancellationcode
= right column code
.airport_lat_long
has two joins to the main table, and you
must create each join separately as follows: origin
= right column
locationid
.dest
= right column locationid
.state_abbreviations
has two joins to the main table, and
you must create each join separately:deststate
= right column
abbreviation
.originstate
= right column
abbreviation
.This step is optional, and depends on whether your flights_*
table has fully extended state names.