stepsel.tools.group_over_columns

Functions

group_over_columns(→ pandas.DataFrame)

Group over columns and aggregate by agg_dict

Module Contents

stepsel.tools.group_over_columns.group_over_columns(dt: pandas.DataFrame, columns: numpy.typing.ArrayLike, agg_dict: dict) pandas.DataFrame[source]

Group over columns and aggregate by agg_dict

Parameters:
  • dt (pd.DataFrame) – Dataframe to group over

  • columns (ArrayLike) – Columns to group over. It can be a list of columns or a list of lists of columns.

  • agg_dict (dict) – Dictionary of columns and aggregation functions

Returns:

comparison – Dataframe with grouped and aggregated data

Return type:

pd.DataFrame

Examples

>>> group_over_columns(dt, ["catvar1", "catvar2"], {"smlr": "sum", "preds": "mean", target: "mean"})
>>> group_over_columns(dt, [["catvar1", "catvar2"]], {"smlr": "sum", "preds": "mean", target: "mean"})
>>> group_over_columns(dt, [["catvar1", "catvar2"], "catvar3"], {"smlr": "sum", "preds": "mean", target: "mean"})