Our documentation now lives on the Atlassian Support site at https://support.atlassian.com/analytics/resources/.
How is data structured in the Atlassian Data Lake?
Each product has its own set of tables and columns in the Atlassian Data Lake. If you choose to include user-generated content (UGC) for your products, more columns are added to each of those tables.
Only data for Jira Software, Jira Service Management, and Jira Work Management is available in the Atlassian Data Lake at this time. Data for more products is coming soon.
The product tables capture product data in a star schema, meaning some tables refer to other tables. Because of this, you may need to join multiple queries to get the data you need in Atlassian’s analytics offering.
These tables rapidly reflect changes in your products, typically within half an hour. This makes them especially useful for custom analysis, or when having the most up-to-date information is important.
All tables have row_refreshed_at
information that indicates data freshness. It captures the last data refresh time. For example, a timestamp of 2020-12-30T23:00:41.950817Z
means that the information in the row is current as of 11 p.m. on December 30, 2020, in the UTC time zone.
Dates and timestamps in the Data Lake are in UTC
Keep in mind that all columns with dates and timestamps (for example, created_at
, updated_at
, etc) are in the UTC time zone. To convert these to use a different time zone, either change your workspace time zone in your workspace settings or the individual dashboard’s time zone in its dashboard settings.
Workspace data
All tables in the schema have a workspace_id
column. If you have multiple instances of Jira Cloud in your organization, you can use this column to distinguish between them.
SQL name for the table: workspace
Lists your Jira instances
Table uses the following columns:
Column | Data type | Example data | Description |
---|---|---|---|
| String |
| 36-character unique site identifier. This is the primary key for workspace data. |
| String | tbd | The name of the Jira instance |
| Datetime |
| Last refresh time for this record (UTC time zone) |
Account data
SQL name for the table: account
Lists the Atlassian accounts within your organization
Table uses the following columns:
Column | Data type | Example data | Description |
---|---|---|---|
| String | TBD | Atlassian account ID. This is the primary key for account data. |
| String |
| The name of the account holder |
| String |
| The email address of the account holder |