Skip to content

Dataset Management

The dataset list has two styles: the default card style and the list style.

Dataset Display

Card Style

List Style

Click the list button to switch to the list style. In the list style, Access Popularity and Last Access Time are also provided to help users understand the access status of resources and to clean up resources that have not been accessed for a long time.

Sorting

The dataset list is sorted by modification time in descending order by default.

In the list style, you can click the table header columns to sort, such as sorting by Access Popularity in the image below:

Enter characters in the search box to search for datasets.

Dataset Operations

Click the three-dot menu of the dataset to see common operations, including delete, rename, create a copy, replace dataset, hide, and show.

Create New Dataset

Create Blank Dataset

Click the New Dataset button in the upper right corner to create a dataset. See Create Dataset for details.

Import Dataset

Click the New Dataset button in the upper right corner, and select Import Dataset from the dropdown menu. You can import datasets from other applications or data packages.

Tip

  1. Only local files, data connections, and SQL query datasets support this function, so not all datasets in the application may be listed for import.
  2. After importing the dataset, the dataset indicators and related data models will not be imported into the new application.
  3. When the data permission of the application is set to Consumer, datasets with the acceleration engine enabled cannot be imported.
  4. You can import one or multiple datasets from the application.

Delete

When deleting a dataset, if the dataset is referenced by a model or chart, it cannot be deleted, only unreferenced datasets can be deleted.

Rename

You can rename the dataset to better align with business logic.

Create Copy

Create a copy of the dataset, generating a new dataset.

Copy To

Copy To allows you to copy the dataset to other applications or data packages, enabling cross-application reuse of datasets.

Tip

  1. Only local files, data connections, and SQL query datasets support this function, other datasets do not support it.
  2. When copying the dataset to another application, the dataset indicators and related data models will not be copied.
  3. When copying a dataset with the engine enabled to another application, the data permission of the application must be set to the application author or dataset author mode.

Replace Dataset

Replace the dataset. See Replace Dataset for details.

Hide

After processing the dataset, some intermediate datasets may no longer be needed for charting, so you can hide these datasets. Hidden datasets can still participate in model associations or dataset processing, but they will not be visible on the dashboard and chart pages.

After hiding the dataset, the hide function in the three-dot menu changes to show, and you can click show to set the dataset to visible.

Set as Default Dataset

The system supports setting a dataset as the default dataset. During chart creation, the default dataset in the data package will be displayed, reducing the need to switch and search for datasets.

Dataset Management

An important task beyond data exploration is data preprocessing. In the system, the dataset management page is where this function is implemented. Click on a dataset to open it. Here is a two-dimensional table structure showing the data of the dataset, with the following operations in the header.

Field Selection

Click the Field Selection icon to pop up the field selection list, select the fields of interest, and then click Apply. The dataset page will display the data of the selected fields.

Field selection is temporary, and refreshing the page will clear the selected items.

Filter Data

Click the Filter Data icon to pop up the data filtering window, add filtering conditions in the window, and then click OK to apply. Data filtering is also temporary, and refreshing the page will clear the selected items.

Data Management

In data management, you can perform operations such as appending datasets, appending local files, editing datasets, and exporting data.

Append Dataset

For datasets imported into the engine, there is an Append Dataset menu. See Dataset Append for details.

Append Local File

Only local file datasets have the Append Local File menu. See Append Local File for details.

Edit Dataset

For SQL query datasets, multi-table joint datasets, data aggregation datasets, and merged datasets, there is an Edit Dataset menu.

When editing the dataset, there is a reference check, and fields that are referenced elsewhere cannot be removed and must be retained.

Export Data

When the application is in creation mode, all dataset data can be exported.

Data Information

Data information displays the metadata, including:

  • Dataset Name
  • Dataset Type: Text indicates the source of the dataset: data connection, local file, SQL query, etc., and the icon indicates the storage type of the dataset
  • Data Connection: The data connection used by the dataset
  • Rows/Columns/Size: Displays the space occupied by the dataset in the system
  • Original Table: The original table name of the dataset for data connection datasets
  • Engine Table: The table name of the dataset in the engine for engine connection datasets
  • Enable Acceleration Engine: Turn on/off the acceleration engine
  • Public Dictionary: Enable the public dictionary, allowing modeling between different data sources
  • Inherit Upstream Permissions: When enabled, for transformation datasets (multi-table joint, data aggregation, data merging, row-to-column, column-to-row), the permissions of the upstream dataset can be inherited. If the current dataset or its downstream datasets have been imported into the engine, the inherit upstream permissions function will be disabled
  • Update Time: The time when the dataset content was last updated
  • Data Update:
    • Update Now: For datasets imported into the engine, an update task will be initiated to re-examine the data from the data source and generate the information needed for data exploration. For direct connection datasets, updating now will update the metadata information
    • Update Schedule: Set an update schedule. See Update Schedule for details
  • Update Status: Refers to the status of the most recent data update operation, with states such as update completed, update failed, waiting for update, and updating

Field Management

Field management mainly involves operations on fields, such as field grouping, creating new fields, and modifying field types. See Field Management for detailed instructions.

Metric Management

Metric management involves operations on dataset metrics, including creating new metrics and grouping metrics. See Metric Management for detailed instructions.

HENGSHI SENSE Platform User Manual