Skip to main content

Creating Datasets

Creating a Dataset Repository

To create a new dataset repository, Login to OpenCSG Community, click on New Dataset in the top right corner.

Create dataset repo

In the dataset repository creation page, fill in the following information, and then click the Create Dataset.

  • Specify the owner of the repository: this can be either you or any of the organizations you are affiliated with.
  • Enter your dataset name, alias and description.
  • Specify the license.
  • Creating public datasets is currently not supported. Please contact the administrator contact@opencsg.com for manual review if needed.

Init Page

After creating your dataset repository, you should see a page like this:

Dataset repo file

The README.md file that the system generates for you can be accessed and edited online on the Files tab. Once you are finished, the README.md file will be automatically rendered as dataset card by the system and shown on the Summary tab.

Dataset card can help users better understand your dataset and make your dataset easier to retrieve and discover. We recommend that you create your dataset card according to the dataset card specification. See Dataset Card Specification for more details.

Dataset List

Click on Profile in the top right corner, you will see the list of repositories you have created.

Dataset repo list