Advertisement
Danila_lipatov

drop_dublicates

Jan 14th, 2024
476
0
Never
Not a member of Pastebin yet? Sign Up, it unlocks many cool features!
Python 0.40 KB | None | 0 0
  1. def drop_dublicate(data):
  2.     temp_df = pd.concat([data, data.duplicated(subset=['ogrn','press_release_link', 'company_link', '_date', 'rating', 'fin_instrument'])], axis=1)
  3.     print(temp_df)
  4.     stop = 0
  5.  
  6. if __name__ == '__main__':
  7.     # data = pd.read_excel('.xlsx')
  8.     # check_regions(data)   #TODO done it, new version in last_output.xlsx
  9.  
  10.     data = pd.read_excel('')
  11.     drop_dublicate(data)
  12.  
Advertisement
Add Comment
Please, Sign In to add comment
Advertisement