Our documentation has moved!

Our documentation now lives on the Atlassian Support site at https://support.atlassian.com/analytics/resources/.

How is data structured in the Atlassian Data Lake?

Each product has its own set of tables and columns in the Atlassian Data Lake. If you choose to include user-generated content (UGC) for your products, more columns are added to each of those tables.

Only data for Jira Software, Jira Service Management, and Jira Work Management is available in the Atlassian Data Lake at this time. Data for more products is coming soon.

Learn more about the schema for the Jira family of products

The product tables capture product data in a star schema, meaning some tables refer to other tables. Because of this, you may need to join multiple queries to get the data you need in Atlassian’s analytics offering.

These tables rapidly reflect changes in your products, typically within half an hour. This makes them especially useful for custom analysis, or when having the most up-to-date information is important.

All tables have row_refreshed_at information that indicates data freshness. It captures the last data refresh time. For example, a timestamp of 2020-12-30T23:00:41.950817Z means that the information in the row is current as of 11 p.m. on December 30, 2020, in the UTC time zone.

Dates and timestamps in the Data Lake are in UTC

Keep in mind that all columns with dates and timestamps (for example, created_at, updated_at, etc) are in the UTC time zone. To convert these to use a different time zone, either change your workspace time zone in your workspace settings or the individual dashboard’s time zone in its dashboard settings.

Workspace data

All tables in the schema have a workspace_id column. If you have multiple instances of Jira Cloud in your organization, you can use this column to distinguish between them.

SQL name for the table: workspace

Lists your Jira instances

Table uses the following columns:

Column

Data type

Example data

Description

Column

Data type

Example data

Description

workspace_id

String

xxxxxxxx-xxxx-xxxx-xxxx-xxxxxxxxxxxx

36-character unique site identifier. This is the primary key for workspace data.

name

String

tbd

The name of the Jira instance

row_refreshed_at

Datetime

2021-02-14 23:50:39.984000000

Last refresh time for this record (UTC time zone)

Account data

SQL name for the table: account

Lists the Atlassian accounts within your organization

Table uses the following columns:

Column

Data type

Example data

Description

Column

Data type

Example data

Description

account_id

String

TBD

Atlassian account ID. This is the primary key for account data.

name

String

Charles Atlas

The name of the account holder

email

String

charlie@atlassian.com

The email address of the account holder