Answers for "sort dataframe by multiple columns"

23

sort dataframe by column

df.sort_values(by='col1', ascending=False)
Posted by: Guest on April-30-2020
25

df sort values

>>> df.sort_values(by=['col1'], ascending = False)
    col1 col2 col3
0   A    2    0
1   A    1    1
2   B    9    9
5   C    4    3
4   D    7    2
3   NaN  8    4
Posted by: Guest on April-07-2020
4

sort by two columns in pandas

df.sort_values(['a', 'b'], ascending=[True, False])
Posted by: Guest on August-05-2020
2

pandas sort columns by name

df = df.reindex(sorted(df.columns), axis=1)
Posted by: Guest on November-26-2020
12

dataframe groupby multiple columns

grouped_multiple = df.groupby(['Team', 'Pos']).agg({'Age': ['mean', 'min', 'max']})
grouped_multiple.columns = ['age_mean', 'age_min', 'age_max']
grouped_multiple = grouped_multiple.reset_index()
print(grouped_multiple)
Posted by: Guest on October-15-2020
0

Returns a new DataFrame sorted by the specified column(s)

# Returns a new DataFrame sorted by the specified column(s)

df.sort(df.age.desc()).collect()
# [Row(age=5, name=u'Bob'), Row(age=2, name=u'Alice')]
df.sort("age", ascending=False).collect()
# [Row(age=5, name=u'Bob'), Row(age=2, name=u'Alice')]
df.orderBy(df.age.desc()).collect()
# [Row(age=5, name=u'Bob'), Row(age=2, name=u'Alice')]
from pyspark.sql.functions import *
df.sort(asc("age")).collect()
# [Row(age=2, name=u'Alice'), Row(age=5, name=u'Bob')]
df.orderBy(desc("age"), "name").collect()
# [Row(age=5, name=u'Bob'), Row(age=2, name=u'Alice')]
df.orderBy(["age", "name"], ascending=[0, 1]).collect()
# [Row(age=5, name=u'Bob'), Row(age=2, name=u'Alice')]
Posted by: Guest on April-20-2020

Code answers related to "sort dataframe by multiple columns"

Python Answers by Framework

Browse Popular Code Answers by Language