Answers for "pyspark groupby multiple columns"

12

dataframe groupby multiple columns

grouped_multiple = df.groupby(['Team', 'Pos']).agg({'Age': ['mean', 'min', 'max']})
grouped_multiple.columns = ['age_mean', 'age_min', 'age_max']
grouped_multiple = grouped_multiple.reset_index()
print(grouped_multiple)
Posted by: Guest on October-15-2020
0

pyspark group by and average in dataframes

df.groupBy("Profession").agg({'Age':'avg', 'Gender':'count'}).show()
Posted by: Guest on December-01-2020

Code answers related to "pyspark groupby multiple columns"

Python Answers by Framework

Browse Popular Code Answers by Language