Remove the duplicate serial#'s row if the Asset Status cell is blank

Hello,

I have a large (4K row) spreadsheet with lots of duplicate “Serial Number”
I would like to delete the duplicate row if the associated “Asset Status” cell is blank

I have tried several verbiage to get this to work, but it does not. Any help will be appreciated!.

Hi @pjamies

Without seeing your actual sheet data it’s hard to know how to help, but here’s a quick example I threw together filtering out duplicate #s

import pandas as pd

# Get the data from Table1
df = q.cells("Table1")

# Drop duplicates, keeping the first occurrence
df_no_duplicates = df.drop_duplicates()

# Return the deduplicated data
df_no_duplicates
1 Like