cachai.data.load_dataset

cachai.data.load_dataset(name='', redownload=False)[source]

Load a dataset from GitHub with a persistent cache system.

Parameters
namestr

Name of the dataset to load.

redownloadbool, optional

Whether to force re-downloading the dataset, ignoring the cache (default: False).

Returns
pandas.DataFrame

DataFrame containing the dataset.

Examples

import cachai.data as chd

df = chd.load_dataset('lithium')
print(df.head())
              CNAME  [Fe/H]  A(Li)  ...         e  Z$_{max}$        $L_z$
0  00000302-6002570   -0.31   2.01  ...  0.119362   1.003228  2028.535804
1  00001749-5449565   -0.17   1.98  ...  0.110635   0.670432  1907.144965
2  00012216-5458205   -0.07   1.51  ...  0.276396   0.996552  1836.529851
3  00040666-3709129    0.28   0.62  ...  0.112774   0.501939  1676.768299
4  00042981-4701022   -0.34   1.71  ...  0.257109   1.327526  2239.902280

[5 rows x 9 columns]
...