process_BindingDB¶
process_BindingDB processes BindingDB dataset.
dataset.process_BindingDB(path = None, df = None, y = 'Kd', binary = False, convert_to_log = True, threshold = 30)
- path (str, a directory) - the path that save BindingDB dataset file. Example: “./data/BindingDB_All.tsv”.
- df (pandas.DataFrame) - Dataframe that contains input data, if first parameter “path” is None, use the “df”.
- y (str; can be “Kd”, “Ki”, “IC50” or “EC50”) - specify the binding score.
- binary (bool) - If binary is True, formulate prediction task as a binary classification task. Otherwise, formulate the prediction task as a regression task.
- convert_to_log (bool) - If True, convert the target score to logspace for easier regression’
- threshold (float) - The threshold that select target score ??