Delete Duplicate Emails

Delete Duplicate Emails - Problem

Database Easy

You are given a table Person that contains email addresses with their corresponding IDs.

Your task: Write a DELETE statement to remove all duplicate emails, keeping only the one with the smallest ID.

Important: You must write a DELETE statement, not a SELECT statement. The solution should modify the table in place.

Each email may appear multiple times with different IDs
Keep the row with the minimum ID for each email
Delete all other duplicate rows

Table Schema

Person

Column Name	Type	Description
`id` PK	int	Primary key, unique identifier
`email`	varchar	Email address (no uppercase letters)

Primary Key: id

Note: Each row represents a person with their email address

Input & Output

Example 1 — Basic Duplicate Removal

Input Table:

id	email
1	john@example.com
2	bob@example.com
3	john@example.com

Output:

id	email
1	john@example.com
2	bob@example.com

💡 Note:

The email john@example.com appears twice with IDs 1 and 3. We keep the row with the smaller ID (1) and delete the duplicate with ID 3.

Example 2 — Multiple Duplicates

Input Table:

id	email
1	alice@example.com
2	bob@example.com
3	alice@example.com
4	charlie@example.com
5	bob@example.com

Output:

id	email
1	alice@example.com
2	bob@example.com
4	charlie@example.com

💡 Note:

Multiple emails have duplicates: alice@example.com (IDs 1,3) and bob@example.com (IDs 2,5). We keep the rows with smaller IDs (1,2) and delete the duplicates (3,5).

Constraints

1 ≤ id ≤ 1000
email contains no uppercase letters
email follows valid email format

Visualization

Tap to expand

Asked in

G Google 28 A Amazon 22 M Microsoft 18

Use DELETE with self-join or subquery with MIN(id) to remove duplicate emails while preserving the row with the smallest ID for each email address.

Table Schema

Person

Column Name	Type	Description
`id` PK	int	Primary key, unique identifier
`email`	varchar	Email address (no uppercase letters)

Primary Key: id

Note: Each row represents a person with their email address

Common Approaches

✓ Self-Join DELETE

⏱️ Time: O(n²) Space: O(1)

Use DELETE with self-join to remove rows where the same email exists with a smaller ID. This efficiently identifies and removes duplicates in a single operation.

Subquery DELETE

⏱️ Time: O(n log n) Space: O(n)

Use DELETE with subquery to keep only rows where the ID is the minimum for that email. The subquery finds the smallest ID for each email group.

Self-Join DELETE — Algorithm Steps

Step 1: Join Person table with itself on matching emails
Step 2: Delete rows where another row has same email but smaller ID

Visualization

Tap to expand

Step-by-Step Walkthrough

Self-Join

Join table with itself on email

Filter

Find rows with same email but larger ID

Delete

Remove duplicate rows

Code -

solution.c — C

Time & Space Complexity

Time Complexity

⏱️

O(n²)

Self-join compares each row with others

⚠ Quadratic Growth

Space Complexity

O(1)

In-place deletion, no extra storage

✓ Linear Space

125.0K Views

High Frequency

~12 min Avg. Time

892 Likes

Ln 1, Col 1

Smart Actions

💡 Explanation

AI Ready

💡 Suggestion Tab to accept Esc to dismiss

// Output will appear here after running code

Code Editor Closed

Click the red button to reopen

Table Schema

Input & Output

Constraints

Visualization

Related Problems

Table Schema

Common Approaches

Self-Join DELETE — Algorithm Steps

Visualization

Code -

Time & Space Complexity

Select Compiler