cachai.data.load_dataset
- cachai.data.load_dataset(name='', redownload=False)[source]
Load a dataset from GitHub with a persistent cache system.
- Parameters
- name
str Name of the dataset to load.
- redownload
bool, optional Whether to force re-downloading the dataset, ignoring the cache (default:
False).
- name
- Returns
pandas.DataFrameDataFrame containing the dataset.
Examples
import cachai.data as chd df = chd.load_dataset('lithium') print(df.head())
CNAME [Fe/H] A(Li) ... e Z$_{max}$ $L_z$ 0 00000302-6002570 -0.31 2.01 ... 0.119362 1.003228 2028.535804 1 00001749-5449565 -0.17 1.98 ... 0.110635 0.670432 1907.144965 2 00012216-5458205 -0.07 1.51 ... 0.276396 0.996552 1836.529851 3 00040666-3709129 0.28 0.62 ... 0.112774 0.501939 1676.768299 4 00042981-4701022 -0.34 1.71 ... 0.257109 1.327526 2239.902280 [5 rows x 9 columns] ...