renewenergy.clean_data

Module Contents

Functions

clean_data(dataread, dataout, datafile1, datafile2, seed)

Perform all cleaning steps on the dataset

renewenergy.clean_data.clean_data(dataread, dataout, datafile1, datafile2, seed)[source]

Perform all cleaning steps on the dataset

Parameters:
  • dataread (str) – Path to dataset

  • dataout (str) – Path to save training and testing datasets to.

  • datafile1 (str) – Name of CSV file to save test data to.

  • datafile2 (str) – Name of CSV file to save training data to.

  • seed (int) – Used to allow for reproduceability of results.

Returns:

  • training.csv – CSV containing the training data

  • test.csv – CSV containing the test data

Examples

>>> plot_rmse("data/energy_train.csv", "data/energy_test.csv", "results/" )