Shortest Uncommon Substring in an Array

Shortest Uncommon Substring in an Array - Problem

You are given an array arr of size n consisting of non-empty strings.

Find a string array answer of size n such that:

answer[i] is the shortest substring of arr[i] that does not occur as a substring in any other string in arr.
If multiple such substrings exist, answer[i] should be the lexicographically smallest.
If no such substring exists, answer[i] should be an empty string.

Return the array answer.

Input & Output

Example 1 — Basic Case

$ Input: arr = ["cab","ad","bad","c"]

› Output: ["ab","","ba",""]

💡 Note: For "cab": "c" appears in "c", "a" appears in "ad" and "bad", "b" appears in "bad", "ca" appears in none, "ab" appears in none (shorter), so "ab". For "ad": "a" appears in "cab" and "bad", "d" appears in "bad", "ad" appears in none but "ad" is the whole string and since we need shortest, we check if any single char works first - none do, so "ad" is not valid as there might be shorter ones. Actually "d" only appears in "bad" and "ad", but "d" appears in "bad" so it's not unique to "ad". Since no substring of "ad" is unique to "ad" only, return empty string.

Example 2 — Single Characters

$ Input: arr = ["abc","bcd","abcd"]

› Output: ["a","d","b"]

💡 Note: For "abc": "a" doesn't appear in "bcd" or "abcd" (wait, "a" appears in "abcd"), so "a" is not unique. Actually "a" appears in "abcd", "b" appears in "bcd" and "abcd", "c" appears in "bcd" and "abcd". Let's check "ab": appears in "abcd". "bc" appears in "bcd" and "abcd". "abc" appears in "abcd". So "abc" has no unique substring - return empty. Wait, let me recalculate: "a" appears in arr[0] and arr[2], "b" appears in all three, "c" appears in all three, "d" appears in arr[1] and arr[2]. So "a" appears in positions 0,2 so count=2. "d" appears in positions 1,2 so count=2. For "abc", we need substring that appears only in "abc" - none of single chars work, let's try "ab" - appears in "abc" and "abcd", so count=2. No unique substring exists for "abc".

Example 3 — Edge Case

$ Input: arr = ["a","aa","aaa"]

› Output: ["","aa","aaa"]

💡 Note: For "a": "a" appears in all strings, so no unique substring exists. For "aa": "a" appears in all, "aa" appears only in "aa" and "aaa", so count=2, not unique. Actually "aa" as substring appears in "aa" (the whole string) and "aaa" (as substring), so not unique to "aa". Wait, this seems wrong. Let me reconsider: "aa" the whole string is same as "aa" substring of "aaa", so "aa" appears in 2 strings. For "aaa": "aaa" appears only in "aaa", so it's unique.

Constraints

n == arr.length
2 ≤ n ≤ 100
1 ≤ arr[i].length ≤ 20
arr[i] consists of lowercase English letters

Visualization

Tap to expand

Asked in

G Google 15 M Microsoft 12 a Amazon 8

The key insight is to systematically check substrings by increasing length and find the first unique one. The optimal approach uses a hash map to precompute all substring counts, then searches for the shortest lexicographically smallest substring with count=1. Time: O(n × m³), Space: O(n × m²).

Common Approaches

✓ Brute Force - Check All Substrings

⏱️ Time: O(n² × m³) Space: O(m)

For each string, generate all possible substrings starting from length 1. For each substring, check if it appears in any other string in the array. Return the shortest lexicographically smallest unique substring.

Hash Map Optimization

⏱️ Time: O(n × m³) Space: O(n × m²)

First pass: generate all substrings from all strings and count their occurrences in a hash map. Second pass: for each string, find the shortest lexicographically smallest substring that has count = 1.

Brute Force - Check All Substrings — Algorithm Steps

Step 1: For each string, generate all substrings by length (1, 2, 3...)
Step 2: For each substring, check if it exists in any other string
Step 3: Return the first unique substring found (shortest and lexicographically smallest)

Visualization

Tap to expand

Step-by-Step Walkthrough

Generate Length 1

Try all single characters first

Check Uniqueness

Test if substring exists in other strings

Return First Unique

Pick shortest lexicographically smallest

Code -

solution.c — C

#include <stdio.h>
#include <stdlib.h>
#include <string.h>

char** solution(char** arr, int n, int* returnSize) {
    char** result = (char**)malloc(n * sizeof(char*));
    *returnSize = n;
    
    for (int i = 0; i < n; i++) {
        char* current = arr[i];
        int currentLen = strlen(current);
        int found = 0;
        result[i] = (char*)malloc(101 * sizeof(char));
        result[i][0] = '\0';
        
        // Try substrings of increasing length
        for (int length = 1; length <= currentLen && !found; length++) {
            char candidates[100][101];
            int candidateCount = 0;
            
            // Generate all substrings of current length
            for (int start = 0; start <= currentLen - length; start++) {
                char substring[101];
                strncpy(substring, current + start, length);
                substring[length] = '\0';
                
                // Check if this substring appears in any other string
                int unique = 1;
                for (int j = 0; j < n; j++) {
                    if (i != j && strstr(arr[j], substring) != NULL) {
                        unique = 0;
                        break;
                    }
                }
                
                if (unique) {
                    strcpy(candidates[candidateCount], substring);
                    candidateCount++;
                }
            }
            
            // If we found unique substrings of this length, take lexicographically smallest
            if (candidateCount > 0) {
                strcpy(result[i], candidates[0]);
                for (int k = 1; k < candidateCount; k++) {
                    if (strcmp(candidates[k], result[i]) < 0) {
                        strcpy(result[i], candidates[k]);
                    }
                }
                found = 1;
            }
        }
    }
    
    return result;
}

int main() {
    char line[10000];
    fgets(line, sizeof(line), stdin);

    // Parse JSON array manually
    char* arr[100];
    int n = 0;

    char* p = line;

    // Skip until opening [
    while (*p && *p != '[') p++;
    if (*p == '[') p++;

    while (*p && *p != ']') {
        // Skip whitespace and commas
        while (*p == ' ' || *p == ',' || *p == '\n' || *p == '\t') p++;
        if (*p == ']' || *p == '\0') break;

        if (*p == '"') {
            p++; // skip opening quote
            char* start = p;

            // Move until closing quote
            while (*p && *p != '"') p++;

            int len = p - start;
            arr[n] = (char*)malloc(len + 1);
            strncpy(arr[n], start, len);
            arr[n][len] = '\0';
            n++;

            if (*p == '"') p++; // skip closing quote
        }
    }

    int returnSize;
    char** result = solution(arr, n, &returnSize);

    printf("[");
    for (int i = 0; i < returnSize; i++) {
        printf("\"%s\"", result[i]);
        if (i < returnSize - 1) printf(",");
    }
    printf("]\n");

    // Cleanup
    for (int i = 0; i < n; i++) {
        free(arr[i]);
    }
    for (int i = 0; i < returnSize; i++) {
        free(result[i]);
    }
    free(result);

    return 0;
}

Time & Space Complexity

Time Complexity

⏱️

O(n² × m³)

For each string (n), generate O(m²) substrings, check each against n other strings taking O(m) time

⚠ Quadratic Growth

Space Complexity

O(m)

Only store current substring and result array

✓ Linear Space

2.2K Views

Medium Frequency

~25 min Avg. Time

89 Likes

Ln 1, Col 1

Smart Actions

💡 Explanation

AI Ready

💡 Suggestion Tab to accept Esc to dismiss

// Output will appear here after running code

Code Editor Closed

Click the red button to reopen

Input & Output

Constraints

Visualization

Related Problems

Common Approaches

Brute Force - Check All Substrings — Algorithm Steps

Visualization

Code -

Time & Space Complexity

Select Compiler