Datasets (cachai.data)

Documentation /Datasets

The cachai.data module allows you to access pre-created datasets hosted in the cachai-datasets GitHub. Downloading datasets requires an internet connection, if the files are already cached (i.e., you’ve accessed them before), cachai will use the local copies, allowing offline work.

Contents

load_dataset(…)

Load a dataset from GitHub with a persistent cache system.

get_dataset_repo()

Return the URL of the cachai datasets repository.

get_dataset_names()

Retrieve the list of available dataset names.

get_dataset_metadata(…)

Print the metadata of a specific dataset.

clear_cache(…)

Delete old cached files from cachai's cache directory.