Answers for "pandas remove duplicate rows based on one column"

5

python: remove duplicate in a specific column

df = df.drop_duplicates(subset=['Column1', 'Column2'], keep='first')
Posted by: Guest on July-22-2020
3

drop duplicates pandas first column

import pandas as pd 
  
# making data frame from csv file 
data = pd.read_csv("employees.csv") 
  
# sorting by first name 
data.sort_values("First Name", inplace = True) 
  
# dropping ALL duplicte values 
data.drop_duplicates(subset ="First Name",keep = False, inplace = True) 
  
# displaying data 
print(data)
Posted by: Guest on June-28-2020
2

remove duplicates based on two columns in dataframe

df.drop_duplicates(['A','B'],keep= 'last')
Posted by: Guest on August-13-2020
0

drop row with duplicate value

import pandas as pd
df = pd.DataFrame({"A":["foo", "foo", "foo", "bar"], "B":[0,1,1,1], "C":["A","A","B","A"]})
df.drop_duplicates(subset=['A', 'C'], keep=False)
Posted by: Guest on January-04-2020

Code answers related to "pandas remove duplicate rows based on one column"

Python Answers by Framework

Browse Popular Code Answers by Language