clintk.text_parser.section_manager module¶
Module to manage sections found by parser
-
clintk.text_parser.section_manager.main_splitter(df, columns)[source]¶ splits all the entries of df
Using main_splitter causes to split texts into several rows, one text is split into the number of sections it contains
Parameters: - df (pd.DataFrame)
- columns (list of str)
Returns: Return type: pd.DataFrame
-
clintk.text_parser.section_manager.reduce_dic(dico, sections)[source]¶ merges key, values of a dictionary
@TODO find sections names using regex
Parameters: - dico (dict)
- sections (list of str) – name of the sections to keep as in ReportsParser.sections
Returns: concatenated contents of sections
Return type: