Answers for "one hot encoder"

2

python convert categorical data to one-hot encoding

# Basic syntax:
df_onehot = pd.get_dummies(df, columns=['col_name'], prefix=['one_hot'])
# Where:
#	- get_dummies creates a one-hot encoding for each unique categorical
#		value in the column named col_name
#	- The prefix is added at the beginning of each categorical value 
#		to create new column names for the one-hot columns

# Example usage:
# Build example dataframe:
df = pd.DataFrame(['sunny', 'rainy', 'cloudy'], columns=['weather'])
print(df)
  weather
0   sunny
1   rainy
2  cloudy

# Convert categorical weather variable to one-hot encoding:
df_onehot = pd.get_dummies(df, columns=['weather'], prefix=['one_hot'])
print(df_onehot)
	one_hot_cloudy	 one_hot_rainy   one_hot_sunny
0                0               0               1
1                0               1               0
2                1               0               0
Posted by: Guest on November-12-2020
0

one hot encoder

from sklearn.preprocessing import OneHotEncoder
enc = OneHotEncoder()
# transforming the column after fitting
enc = enc.fit_transform(df[['nom_0']]).toarray()
# converting arrays to a dataframe
encoded_colm = pd.DataFrame(enc)
# concating dataframes 
df = pd.concat([df, encoded_colm], axis = 1) 
# removing the encoded column.
df = df.drop(['nom_0'], axis = 1) 
df.head(10)
Posted by: Guest on October-01-2021
0

one hot encoder

>>> from sklearn.preprocessing import OneHotEncoder
One can discard categories not seen during fit:

>>>
>>> enc = OneHotEncoder(handle_unknown='ignore')
>>> X = [['Male', 1], ['Female', 3], ['Female', 2]]
>>> enc.fit(X)
OneHotEncoder(handle_unknown='ignore')
>>> enc.categories_
[array(['Female', 'Male'], dtype=object), array([1, 2, 3], dtype=object)]
>>> enc.transform([['Female', 1], ['Male', 4]]).toarray()
array([[1., 0., 1., 0., 0.],
       [0., 1., 0., 0., 0.]])
>>> enc.inverse_transform([[0, 1, 1, 0, 0], [0, 0, 0, 1, 0]])
array([['Male', 1],
       [None, 2]], dtype=object)
>>> enc.get_feature_names(['gender', 'group'])
array(['gender_Female', 'gender_Male', 'group_1', 'group_2', 'group_3'],
  dtype=object)
One can always drop the first column for each feature:

>>>
>>> drop_enc = OneHotEncoder(drop='first').fit(X)
>>> drop_enc.categories_
[array(['Female', 'Male'], dtype=object), array([1, 2, 3], dtype=object)]
>>> drop_enc.transform([['Female', 1], ['Male', 2]]).toarray()
array([[0., 0., 0.],
       [1., 1., 0.]])
Or drop a column for feature only having 2 categories:

>>>
>>> drop_binary_enc = OneHotEncoder(drop='if_binary').fit(X)
>>> drop_binary_enc.transform([['Female', 1], ['Male', 2]]).toarray()
array([[0., 1., 0., 0.],
       [1., 0., 1., 0.]])
Posted by: Guest on September-20-2021
0

onehotencoder = OneHotEncoder(categorical_features = [1]) X = onehotencoder.fit_transform(X).toarray() X = X[:, 1:]

from sklearn.compose import ColumnTransformer

ct = ColumnTransformer([('encoder', OneHotEncoder(), [1])], remainder='passthrough')
X = np.array(ct.fit_transform(X), dtype=np.float)
Posted by: Guest on April-23-2020

Browse Popular Code Answers by Language