Query Functions

The functions on this page run installed or interpret queries in TigerGraph. All functions in this module are called as methods on a TigerGraphConnection object.

Methods

showQuery()

showQuery(queryName: str) → str

Returns the string of the given GSQL query.

Parameter:

queryName (str): Name of the query to get metadata of.

getQueryMetadata()

getQueryMetadata(queryName: str) → dict

Returns metadata details about a query. Specifically, it lists the input parameters in the same order as they exist in the query and outputs PRINT statement syntax.

Parameter:

queryName (str): Name of the query to get metadata of.

Endpoints:

POST /gsqlserver/gsql/queryinfo (In TigerGraph versions 3.x) See xref:tigergraph-server:API:built-in-endpoints.adoc_get_query_metadata
POST /gsql/v1/queries/signature (In TigerGraph versions 4.x)

getInstalledQueries()

getInstalledQueries(fmt: str = "py") → Union[dict, str, pd.DataFrame]

Returns a list of installed queries.

Parameter:

fmt: Format of the results:
- "py": Python objects (default)
- "json": JSON document
- "df": pandas DataFrame

Returns:

The names of the installed queries.

runInstalledQuery()

runInstalledQuery(queryName: str, params: Union[str, dict] = None, timeout: int = None, sizeLimit: int = None, usePost: bool = False, runAsync: bool = False, replica: int = None, threadLimit: int = None, memoryLimit: int = None) → list

Runs an installed query.

The query must be already created and installed in the graph. Use getEndpoints(dynamic=True) or GraphStudio to find out the generated endpoint URL of the query. Only the query name needs to be specified here.

Parameters:

queryName: The name of the query to be executed.
params: Query parameters. A string of param1=value1&param2=value2 format or a dictionary. See below for special rules for dictionaries.
timeout: Maximum duration for successful query execution (in milliseconds). See GSQL query timeout
sizeLimit: Maximum size of response (in bytes). See Response size
usePost: Defaults to False. The RESTPP accepts a maximum URL length of 8192 characters. Use POST if additional parameters cause you to exceed this limit, or if you choose to pass an empty set into a query for database versions >= 3.8
runAsync: Run the query in asynchronous mode. See Async operation
replica: If your TigerGraph instance is an HA cluster, specify which replica to run the query on. Must be a value between [1, (cluster replication factor)]. See Specify replica
threadLimit: Specify a limit of the number of threads the query is allowed to use on each node of the TigerGraph cluster. See Thread limit
memoryLimit: Specify a limit to the amount of memory consumed by the query (in MB). If the limit is exceeded, the query will abort automatically. Supported in database versions >= 3.8. See Memory limit

Returns:

The output of the query, a list of output elements (vertex sets, edge sets, variables, accumulators, etc.

Notes:

When specifying parameter values in a dictionary:

For primitive parameter types use
"key": value
For SET and BAG parameter types with primitive values, use
"key": [value1, value2, …]
For VERTEX<type> use
"key": primary_id
For VERTEX (no vertex type specified) use
"key": (primary_id, "vertex_type")
For SET<VERTEX<type>> use
"key": [primary_id1, primary_id2, …]
For SET<VERTEX> (no vertex type specified) use
"key": [(primary_id1, "vertex_type1"), (primary_id2, "vertex_type2"), …]

Endpoints:
GET /query/{graph_name}/{query_name} See Run an installed query (GET)
POST /query/{graph_name}/{query_name} See Run an installed query (POST)

checkQueryStatus()

checkQueryStatus(requestId: str = "")

Checks the status of the queries running on the graph specified in the connection.

Parameter:

requestId (str, optional): String ID of the request. If empty, returns all running requests. See Check query status (detached mode)

Endpoint:

GET /query_status/{graph_name} See Check query status (detached mode)

getQueryResult()

getQueryResult(requestId: str = "")

Gets the result of a detached query.

Parameter:

requestId (str): String ID of the request. See Check query results (detached mode)

runInterpretedQuery()

runInterpretedQuery(queryText: str, params: Union[str, dict] = None) → list

Runs an interpreted query.

Use $graphname or @graphname@ in the FOR GRAPH clause to avoid hardcoding the name of the graph in your app. It will be replaced by the actual graph name.

Parameters:

queryText: The text of the GSQL query that must be provided in this format:

INTERPRET QUERY (<params>) FOR GRAPH <graph_name> {
<statements>
}

params: A string of param1=value1&param2=value2… format or a dictionary. See below for special rules for dictionaries.

Returns:

The output of the query, a list of output elements such as vertex sets, edge sets, variables and accumulators.

Notes:

When specifying parameter values in a dictionary:

For primitive parameter types use
"key": value
For SET and BAG parameter types with primitive values, use
"key": [value1, value2, …]
For VERTEX<type> use
"key": primary_id
For VERTEX (no vertex type specified) use
"key": (primary_id, "vertex_type")
For SET<VERTEX<type>> use
"key": [primary_id1, primary_id2, …]
For SET<VERTEX> (no vertex type specified) use
"key": [(primary_id1, "vertex_type1"), (primary_id2, "vertex_type2"), …]

Endpoints:

POST /gsqlserver/interpreted_query (In TigerGraph versions 3.x) See Run an interpreted query
POST /gsql/v1/queries/interpret (In TigerGraph versions 4.x)

getRunningQueries()

getRunningQueries() → dict

Reports the statistics of currently running queries on the graph.

abortQuery()

abortQuery(request_id: Union[str, list] = None, url: str = None)

This function safely aborts a selected query by ID or all queries of an endpoint by endpoint URL of a graph. If neither request_id or url are specified, all queries currently running on the graph are aborted.

Parameters:

request_id (str, list, optional): The ID(s) of the query(s) to abort. If set to "all", it will abort all running queries. url

parseQueryOutput()

parseQueryOutput(output: list, graphOnly: bool = True) → dict

Parses query output and separates vertex and edge data (and optionally other output) for easier use.

Parameters:

output: The data structure returned by runInstalledQuery() or runInterpretedQuery().
graphOnly: If True (the default setting), restricts captured output to vertices and edges. If False, captures values of variables and accumulators and any other plain text printed.

Returns:

A dictionary with two (or three) keys: "vertices", "edges" and optionally "output". The first two refer to another dictionary containing keys for each vertex and edge types found and the instances of those vertex and edge types. "output" is a list of dictionaries containing the key/value pairs of any other output.

The JSON output from a query can contain a mixture of results: vertex sets (the output of a SELECT statement), edge sets (e.g. collected in a global accumulator), printout of global and local variables and accumulators, including complex types (LIST, MAP, etc.). The type of the various output entries is not explicit and requires manual inspection to determine the type.

This function "cleans" this output, separating and collecting vertices and edges in an easy to access way. It can also collect other output or ignore it.
The output of this function can be used e.g. with the vertexSetToDataFrame() and edgeSetToDataFrame() functions or (after some transformation) to pass a subgraph to a visualization component.

getStatistics()

getStatistics(seconds: int = 10, segments: int = 10) → dict

Retrieves real-time query performance statistics over the given time period.

Parameters:

seconds: The duration of statistic collection period (the last n seconds before the function call).
segments: The number of segments of the latency distribution (shown in results as LatencyPercentile). By default, segments is 10, meaning the percentile range 0-100% will be divided into ten equal segments: 0%-10%, 11%-20%, etc. This argument must be an integer between 1 and 100.

Endpoint:

GET /statistics/{graph_name} See Show query performance

describeQuery()

describeQuery(queryName: str, queryDescription: str, parameterDescriptions: dict)

Add a query description and parameter descriptions. Only supported on versions of TigerGraph >= 4.0.0.

Parameters:

queryName: The name of the query to describe.
queryDescription: A description of the query.
parameterDescriptions (optional): A dictionary of parameter descriptions. The keys are the parameter names and the values are the descriptions.

Returns:

The response from the database.

Endpoints:

PUT /gsqlserver/gsql/description?graph={graph_name} (In TigerGraph version 4.0)
PUT /gsql/v1/description?graph={graph_name} (In TigerGraph versions >4.0)

getQueryDescription()

getQueryDescription(queryName = "all")

Get the description of a query. Only supported on versions of TigerGraph >= 4.0.0.

Parameter:

queryName: The name of the query to get the description of. If multiple query descriptions are desired, pass a list of query names. If set to "all", returns the description of all queries.

Returns:

The description of the query(ies).

Endpoints:

GET /gsqlserver/gsql/description?graph={graph_name} (In TigerGraph version 4.0)
GET /gsql/v1/description?graph={graph_name} (In TigerGraph versions >4.0)

dropQueryDescription()

dropQueryDescription(queryName: str, dropParamDescriptions: bool = True)

Drop the description of a query. Only supported on versions of TigerGraph >= 4.0.0.

Parameters:

queryName: The name of the query to drop the description of. If set to "*", drops the description of all queries.
dropParamDescriptions: Whether to drop the parameter descriptions as well. Defaults to True.

Returns:

The response from the database.

Endpoints:

DELETE /gsqlserver/gsql/description?graph={graph_name} (In TigerGraph version 4.0)
DELETE /gsql/v1/description?graph={graph_name} (In TigerGraph versions >4.0)