Load from Azure Blob Storage
TigerGraph Savanna allows you to load data directly from Azure Blob Storage. This guide will walk you through the steps to configure and load data from Azure Blob Storage into your TigerGraph workspace.
1) Select Source
Once you’ve selected Azure Blob Storage, you will be asked to configure the Azure Blob Storage data source. . Click on to add a new Azure Blob Storage data source.
+
-
You will need to provide your
ABS Connection String
. -
Once you have those configured, you can add one or multiple
ABS URI
within the same blob storage account. -
Click Next to process the file(s).
The current data loading tool only supports CSV,TSV and JSON files. Other formats will be available in later releases.
2) Configure File
This step lets you configure the source file details.
. The data loading tool will automatically detect the .csv
separators and line breaks.
The parser automatically splits each line into a series of tokens.
+ image::config-file.png[]
+
If the parsing is not correct, click on the
button to configure a different option for the delimiter, such as
|
+ The enclosing character is used to mark the boundaries of a token, overriding the delimiter character.
+
For example, if your delimiter is a comma, but you have commas in some strings, then you can define single or double quotes as the enclosing character to mark the endpoints of your string tokens.
+
It is not necessary for every token to have enclosing characters. The parser uses enclosing characters when it encounters them. |
+
You can edit the header line of the parsing result to give each column a more intuitive name, since you will be referring to these names when loading data to the graph. The header name is ignored during data loading. |
-
Once you are satisfied with the file settings, click Next to proceed.
3) Configure Map
If you are loading data into a brand new graph, you will be prompted to let our engine generate a schema and mapping for you. Or you can start from scratch. For more details of schema design please refer to Design Schema.
. Select Generate the schema only
or Generate the schema and data mapping
.
+ image::Screenshot 2024-04-17 at 5.55.35 PM.png[]
+
The schema generation feature is still a preview feature. The correctness and efficiency of the resulting graph schema and mapping could vary. |
-
In the
Source
column, you can choose the specific column from the data source that you want to map with the attribute. -
Use the
+
button to create a new attribute of the target vertex or edge. -
Click the Token Function button to configure token functions for the selected source. For more details of configuring token functions, please refer to Token Function.
-
Click the Quick Map button to quickly map the data source headers to the existing schema attributes.
-
The Map all to target button aligns existing attribute names with the corresponding data source headers, it won’t introduce new attributes.
-
The Map all from source button not only aligns existing attribute names with the corresponding dat asource headers, but also introduces new attributes based on unmatched data source headers.
-
The following list shows the mapping status of each attribute, you can manually adjust the mapping by checking the box next to the attribute name.
-
-
Click Next to proceed.
4) Confirm
This step will let you confirm the changes made to the schema and the data mapping you created to load the data.
. Simply review the Schema to be changed
and Data to be loaded
lists.
+ image::confirm.png[]
+
Please be aware that some schema changes will result in unintentional deletion of the data. Please carefully review the warning message before confirming the loading. |
-
Click on the Confirm button to run the loading jobs and monitor their
Status
.
Next Steps
Next, learn how to use Design Schema, GSQL Editor and Explore Graph in TigerGraph Savanna.
Or return to the Overview page for a different topic.