You need data

You have a problem that can be solved using data science and you need to train an algorithm. provides you with:

The most comprehensive public and private datasets; searchable by characteristics, geography and industry segment.

Free and paid offerings. Some data is raw, while others have had extensive work to clean and meta-tag specifically for data science. If you have a customer request, the community has extensive resources to support your efforts.

Access to the datasets you require while automating the financial, legal and technical requirements for delivery.

Here’s how it works

You can simply pay for access to data, or the marketplace supports a more sophisticated creation/ownership model based on shared revenue.

Many data providers understand the potential value of their data and would like to participate in the revenue stream generated by its use. tracks utilization and provides audit-ability for all providers of value.

Creating a data-pipeline can have many Vaders (Value adders), including multiple datasets, data transformation, validation, meta-tagging etc. automates the process of compensating each contributor based on the revenue generated from the model. This allows unfettered creativity without the onus of upfront expenditure, sharing the financial benefit across the value chain.

You have data

It’s valuable, and you want it monetized and protected without losing control. is the enabling platform. Everyday thousands of people, willing to pay, come looking for data to train their models.

There are 2 types of data in the marketplace:


Standard data is fully accessible and viewable to any registered user.

Privacy protected

Privacy-protected data requires the algorithm to train at the provider premises, only returning the statistical models. This guarantees that private data is never compromised.

We support multiple Privacy Enhancing Technologies (PETs).

Here’s how it works

The steps to make your data available on

1 Locate your Data

In situ – i.e. remains where it is, and access is provided via’s API.
On a third-party system – all the major cloud-based AI platforms, such as AWS, Google, Azure, PyTorch etc., have the capability to host datasets and can securely manage access to these datasets via standard APIs.
On – allows hosting your data on our secure high-speed infrastructure.

2 Describe your data

There are a variety of standards and tools that allow you to describe the characteristics of your dataset.
Have your data validated – we encourage you to work with a Vader group, either one of your making or an existing team that you can find listed here.

3 Prepare your data

Using the tools available in the market and from selected third-party vendors or using approved service partners, you can prepare your dataset for listing in the market. In addition, the revenue model that you have created for your model is defined and implemented.


Bid created and registered with the system.