Module markov.api.data.catalog_handler
Classes
class DataCatalogHandler (for_profile='default')
-
Static methods
def get_data_families() ‑> List[DataFamily]
def get_data_family_by_id(df_id: str) ‑> DataFamily
def get_data_family_by_name(df_name: str)
def get_ds_info_for_df(df_id: str, start: int = 0, end: int = -1) ‑> List[DSNameIdTuple]
-
Given the data_family_id, return the list of all the datasets that belong to this family
Args
df_id:data_set family id for which we need to find all the datasets start:start of the batch, sorted by time of registration end:last one in the batch, sorted by time of registration
Returns
List[DataSet]
def get_ds_with_prefix(prefix='', df_id: str = None) ‑> List[DataSet]
-
Get the data set with the name matching prefix.Provide the data_family_id if you want to restrict to a specific data family
Args
prefix
:str
- Name
df_id
:str
- data family id
Returns
List of datasets that match the constraint specified
def get_not_analyzed_datasets() ‑> List[DataSet]
-
Fetch all the datasets from the data_set catalog that are not analyzed Returns:
def trigger_analysis_run(da_requests: DataAnalysisRunRequest) ‑> DataAnalysisRunResponse
def update_datafamily(ds_id: str, df_id: str)
-
Update DataFamily of the Dataset given by ds_id
Args
ds_id
:str
- dataset id of the dataset we want to update the datafamily of
df_id
:str
- datafamily if to be updated to
Returns:
def update_ds_paths(ds_id: str, ds_paths: List[DataSegmentPath]) ‑> DataSet