Analyze duplicate token entries in the Solana ecosystem and identify the “original” token (the token created earlier than others with the same name and token symbol.
- Only take into account tokens created before March 19, 2025.
- Consider duplicates with different cases (”SSS” is a duplicate of “sss” and “SsS”)
- Only output duplicates that have at least 50 tokens with the same name and symbol (50 including the original one)
- Only output top-40 of such tokens with the most recent creation time of the original token
The result should include the following data:
- Token name and symbol (from the earliest occurrence)
- First creation timestamp (original token creation time)
- Last creation timestamp
- Total number of duplicates (including case variations)
- Token mint address from the earliest occurrence (original token address)
Solana data:
I recommend using the [tokens_solana.fungible](<https://docs.dune.com/data-catalog/curated/solana/asset-tracking/solana-token-metadata>) table on Dune or similar sources with token data.
Correctness criteria:
The table should be equal to a baseline solution.