Uploading a Dataset

Contributing data on Nuklai’s public data marketplace is an important part of Nuklai's data ecosystem, opening your profile up for new business collaborations and revenue opportunities of other Nuklai members subscribing to your datasets.

In this user guide we will show you how to publish your first dataset. For visual learners, we have prepared this video tutorial.

When you arrive on the Nuklai marketplace, you will see a button, upload dataset. When you click this button, you will start will need to choose between two options:

  • Dataset: choose this option if you want to upload a dataset with you as the sole contributor.

  • Community Dataset: choose this option if you want to upload a dataset where contributions are open to any user.

Once you have chosen your desired dataset, you will be asked to fill in a title for the dataset. The title can still be changed at a later time. You will be asked to provide a file up to 2GB in size. If you have larger data files that you would like to publish, please contact hello@nukl.ai and we will assist you in getting your data published.

Supported file formats:

  • CSV

  • JSON

  • XML

  • Parquet

When you click the upload button, the file will be processed. This can take several minutes. You can view your dataset on the my datasets page. When the status updates from “processing” to “draft”, your dataset has been processed successfully. We now recommend you to query your dataset via the user interface by clicking on the query button (rocket icon) to check if all data has been correctly processed in the way you intended.

Once your dataset is in draft you have 14 days to finalize the publication of the dataset. In this phase you are still able to delete the dataset. After publishing you will not be able to delete the dataset anymore, if you do wish to remove the dataset from the Nuklai data marketplace, please contact hello@nukl.ai.

When you click on the edit button (pencil icon), you will have several options:

  • Edit Dataset Information: here you can add a description of your dataset, update the title and provide additional information about your dataset.

  • Edit Metadata: here you can enrich the metadata of your dataset, by adding tags and descriptions to the metadata.

To continue publishing your dataset you will need to add some information to your dataset, you do this by clicking on the edit button underneath the dataset information card.

  • Dataset name (mandatory): title of the dataset that will be displayed on the Nuklai data marketplace

  • Dataset sample (optional): choose if you want the user to see a sample of your dataset, a sample consists of 5 records within your dataset

The last step is to choose a license under which you want to publish your dataset. If you have a commercial licence for your dataset you can choose to create a Custom License. You have to create this license only once, it will be accessible for you for future usage.

  • Abbreviation: create a simple two to four letter abbreviation of your licence

  • Full Licence Name: write out the full name of your licence.

To continue, click the save button. Congratulations, you are now one step closer to publishing your dataset.

Before publishing, we recommend you to enrich the metadata of your dataset to provide more context about the data within your dataset. Please follow the following userguide to enrich your metadata. In this guide, you will also learn how to delete certain fields from your dataset before publishing. You may want to delete some fields if these fields contain sensitive information.

To finalise the publishing of your dataset, click on the Publish button on the “Pricing & Publishing'' card in order finalse publish the dataset to the Nuklai data marketplace.

You will now have to choose the base price of your dataset, the base price is the amount of $USDC.e you would ask for subscribing one day. If a user subscribes for 30 days, the price of the subscription will be 30 x the base price.

In the overview on the right, you will be able to see an example calculation, exactly showcasing how much you earn from a subscription.

Since Nuklai is a collaborative marketplace, you can decide exactly how the revenue is distributed amongst contributors to your dataset.

  • Platform Fee: The fee Nuklai charges for the usage of the platform. If you upload the data to the Nuklai data environment you’ll be charged a 35% management fee. If you host the data yourself (coming soon), Nuklai will charge you a 15% management fee.

  • Management Fee (Community Datasets only): The fee you will receive for the management of the community dataset, such as updating the information of the dataset and promoting the dataset. The maximum fee that can be set is 50%.

  • Data revenue share: This fee is split amongst the providers of the data (in several parts in case of a community dataset)

  • Metadata revenue share (optional): The fee contributors earn for enriching the metadata (in several parts when several users contributed metadata)

When you fill in all the fields, you can click the save button. Metamask will now pop up and asks you to sign two transactions. Please do not refresh your page.

When you sign the transactions, a dataset NFT is minted. You will need some AVAX in your wallet to pay for the gas fees for minting the dataset NFT. When you want to transfer the ownership of the dataset, you will need to send your dataset NFT to another wallet. The ownership of the dataset will be automatically updated on the platform to the new wallet.

Congratulations you have now published your dataset. You can verify this by going to my dataset page and checking if the status of the dataset is “published”.

Last updated