In the following steps we will show a short guide to help you getting started with ml-stars.
We will be executing a single primitive for data transformation.
The first step in order to run a primitive is to load it.
This will be done using the mlstars.load_primitive function, which will load the indicated primitive as an MLBlock Object from MLBlocks.
mlstars.load_primitive
In this case, we will load the sklearn.preprocessing.MinMaxScaler primitive.
sklearn.preprocessing.MinMaxScaler
In [1]: from mlstars import load_primitive In [2]: primitive = load_primitive('sklearn.preprocessing.MinMaxScaler')
The StandardScaler is a transformation primitive which scales your data into a given range.
To use this primtives, we generate a synthetic data with some numeric values.
In [3]: import numpy as np In [4]: data = np.array([10, 1, 3, -1, 5, 6, 0, 4, 13, 4]).reshape(-1, 1)
The data is a list of integers where their original range is between [-1, 13].
data
In order to run our primitive, we first need to fit it.
This is the process where it analyzes the data to detect what is the original range of the data.
This is done by calling its fit method and passing the data as X.
In [5]: primitive.fit(X=data)
Once the pipeline is fit, we can process the data by calling the produce method of the primitive instance and passing agin the data as X.
In [6]: transformed = primitive.produce(X=data) In [7]: transformed Out[7]: array([[0.78571429], [0.14285714], [0.28571429], [0. ], [0.42857143], [0.5 ], [0.07142857], [0.35714286], [1. ], [0.35714286]])
We can see how the transformed data contains the transformed values and the data is now in [0, 1] range.
transformed