🔍 Open Dataset Finder

This app lets you search datasets from multiple open data sources.

  • Hugging Face Datasets: Public machine learning datasets for NLP, computer vision, speech, and more.
  • Zenodo: Research datasets shared by scientists and institutions, often linked to academic publications.
  • Kaggle: Community datasets, competition datasets, and practice datasets shared on Kaggle.

Kaggle authentication

To enable Kaggle search, you need to add your Kaggle API credentials as Repository secrets in the Space settings:

  • KAGGLE_USERNAME: your Kaggle username
  • KAGGLE_KEY: your Kaggle API token (found in the kaggle.json file you can download from your Kaggle account)

Once the secrets are set, you can check the Kaggle box in the UI and search Kaggle datasets directly here.

Source repository

10 200