Cacher#
- class sdgx.cachers.base.Cacher(blocksize, *args, **kwargs)[source]#
Bases:
objectBase class for cachers
Cacher is used to cache raw data and processed data to prevent repeat read or process.
You can treat Cacher as a warrper of DataConnector
- clear_invalid_cache()[source]#
Clear invalid cache.
It’s useful when data source has been changed. Subclass can try to inspect cache and only clear invalid cache. Also, it may clear all cache when not sure or not support.
- iter(chunksize: int, data_connector: DataConnector) Generator[DataFrame, None, None][source]#
Load data from data_connector or cache in chunk
- load(offset: int, chunksize: int, data_connector: DataConnector) DataFrame[source]#
Load data from data_connector or cache
- load_all(data_connector: DataConnector) DataFrame[source]#
Load all data from data_connector or cache
NoCache#
- class sdgx.cachers.base.NoCache(blocksize, *args, **kwargs)[source]#
Bases:
CacherNo cache means to proxy data_connector
- clear_cache()#
Clear all cache
- clear_invalid_cache()#
Clear invalid cache.
It’s useful when data source has been changed. Subclass can try to inspect cache and only clear invalid cache. Also, it may clear all cache when not sure or not support.
- iter(chunksize: int, data_connector: DataConnector) Generator[DataFrame, None, None][source]#
Proxy to data_connector.iter
- load(offset: int, chunksize: int, data_connector: DataConnector) DataFrame[source]#
Proxy to data_connector.read
- load_all(data_connector: DataConnector) DataFrame[source]#
Proxy to data_connector.read