mlm_insights.mlm_spark.execution_engine package

Submodules

mlm_insights.mlm_spark.execution_engine.spark_engine module

class mlm_insights.mlm_spark.execution_engine.spark_engine.SparkEngine

Bases: ExecutionEngine

Class for Spark execution engine functionality.

parse_data_frame_result(profile_dataframe: DataFrame) Profile

Method to parse the profile dataframe based on different Execution engines As Spark converts to a bytearray whereas Dask provides a byte string. We need to handle this distinction between different Execution engines

Parameters

profile_dataframe: DataFrame

Pandas profile dataframe

Returns

Profile

Contains data summary and includes profile header, information about the features, metrics, SFCs.