After mapping data files to the graph schema, you can start loading data. Click "Load Data" on the left side menu bar to go to the Load Data page.
The "Load Data" interface is separated into three parts:
Data Mapping Overview
Provides a general view of the graph and the data mapping.
Shows the loading progress of each data file.
Toolbar (above Data Mapping)
Start/pause/resume/stop data loading and clear graph data buttons.
Graph statistics: displays the numbers of vertices and edges in total and per type, with real-time loading progress.
Loading statistics: displays the total number of vertices and edges loader vs. time.
GraphStudio provides two types of loading:
Partial Loading: load a subset of the data files which the user selects.
Full Loading: load all of the data files.
Select one or more data files (holding down the "shift" key to select multiple data files), and click on the "start loading" buttonon the toolbar.
Click on a blank space in the data mapping overview panel to unselect the data sources, and click on the "start/resume loading" buttonon the toolbar. While loading is in progress a green hatched bar will appear over each data file to show its real time progress.
Similar to Start Loading, you can pause loading some of the data files, or all loading data files.
Select one or more data files (holding down the "shift" key to select multiple data files), and click on the "pause loading" buttonon the toolbar. In the Paused state, the progress bar will change to a solid orange color.
You can resume loading some or all loading data files which have been paused.
Select one or more data files (holding down the "shift" key to select multiple data files), and click on the "start/resume loading" buttonon the toolbar. After resuming, the data file loading will continue from where it was paused:
After loading has been started or paused, you can stop loading from these data files by clicking the "stop load" button. Similar to Start Loading, you can stop loading some or all loading data files. After stopping, the loading status of the data files will become "Stopped":
The Statistics panel contains two tabs: Graph Statistics (1st tab) and Data Loading Statistics (2nd tab).
By default if no data file is selected, the Statistics panel will show Graph Statistics.
The table at the top shows the total number of vertices and edges in the current graph, and the number of each vertex type and edge type as well. The line chart at the bottom shows the number of vertices and edges over time, when loading is in progress.
If you click on one data file, the Statistics panel will change to show Data Loading Statistics:
The table at the top shows the detailed loading information of the selected data file, including:
Status (RUNNING, PAUSED, STOPPED, etc)
Loaded percentage (for files on server) or loaded size (for S3 file)
Average loading speed
Number of loaded lines
Number of missing token lines
Number of oversize lines
Loading start time
The area chart in the middle shows the real-time loading speed (lines per second) for this data file.
The pie chart at the bottom shows the distribution of data lines, among three categories:
Missing token lines (the lines contain fewer tokens than required by the data mapping)
Oversize lines (some tokens are too large)
If data file loading encounters any issues and gets an error message, the error message will be shown at the bottom:
Click on the "clear graph data" buttonon the toolbar to clear the graph data. This operation will take approximately 1 minute or more, depending on the size of your graph and the hardware.
After the clear operation, the graph vertex and edge number statistics will both drop to 0.
After data has been loaded, you can go to the Explore Graph or Write Queries pages.