read_from_stream function should use input_df from the parameters #38

KrisSimon · 2024-02-26T13:50:49Z

This function in cmd10

def read_from_stream(input_df: DataFrame) -> DataFrame:
    ### YOUR CODE HERE
    raw_stream_data = (
        spark.readStream.format("rate")
        .option("rowsPerSecond", 10)
        .load()
    )
    ###


    # This is just data setup, not part of the exercise
    return raw_stream_data.\
        join(mock_data_df, raw_stream_data.value == mock_data_df.index, 'left').\
        drop("timestamp").\
        drop("index")


df = read_from_stream(mock_data_df)

should use input_df, instead of mock_data_df.
If input parameter is used, then the tests fail.

The text was updated successfully, but these errors were encountered:

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

read_from_stream function should use input_df from the parameters #38

read_from_stream function should use input_df from the parameters #38

KrisSimon commented Feb 26, 2024 •

edited

Loading

read_from_stream function should use input_df from the parameters #38

read_from_stream function should use input_df from the parameters #38

Comments

KrisSimon commented Feb 26, 2024 • edited Loading

KrisSimon commented Feb 26, 2024 •

edited

Loading