Tag Validator - Problem

String Stack Hard

Given a string representing a code snippet, implement a tag validator to parse the code and return whether it is valid.

A code snippet is valid if all the following rules hold:

The code must be wrapped in a valid closed tag. Otherwise, the code is invalid.
A closed tag (not necessarily valid) has exactly the following format: <TAG_NAME>TAG_CONTENT</TAG_NAME>. Among them, <TAG_NAME> is the start tag, and </TAG_NAME> is the end tag. The TAG_NAME in start and end tags should be the same.
A closed tag is valid if and only if the TAG_NAME and TAG_CONTENT are valid.
A valid TAG_NAME only contains upper-case letters, and has length in range [1,9]. Otherwise, the TAG_NAME is invalid.
A valid TAG_CONTENT may contain other valid closed tags, cdata and any characters EXCEPT unmatched <, unmatched start and end tag, and unmatched or closed tags with invalid TAG_NAME.
A start tag is unmatched if no end tag exists with the same TAG_NAME, and vice versa. However, you also need to consider the issue of unbalanced when tags are nested.
A < is unmatched if you cannot find a subsequent >. And when you find a < or </, all the subsequent characters until the next > should be parsed as TAG_NAME.
The cdata has the following format: <![CDATA[CDATA_CONTENT]]>. The range of CDATA_CONTENT is defined as the characters between <![CDATA[ and the first subsequent ]]>. CDATA_CONTENT may contain any characters. The function of cdata is to forbid the validator to parse CDATA_CONTENT.

Input & Output

Example 1 — Valid Simple Tag

$ Input: code = "<DIV>This is the first line <![CDATA[<div>]]></DIV>"

› Output: true

💡 Note: The code is wrapped in a valid closed tag

. The tag name DIV is valid (uppercase, length 3). The content contains CDATA which is properly formatted.

Example 2 — Invalid Tag Name

$ Input: code = "<DIV>>> ![cdata[]] <![CDATA[<div>content</div>]]>></DIV>"

› Output: false

💡 Note: The CDATA section is malformed - it should be but has extra characters.

Example 3 — No Wrapping Tag

$ Input: code = "<A> <B> </A> </B>"

› Output: false

💡 Note: The tags are not properly nested. Tag B is opened inside A but closed after A is closed, violating proper nesting rules.

Constraints

1 ≤ code.length ≤ 500
code consists of English letters, digits, '<', '>', '/', '!', '[', ']', '.', and ' '.

Visualization

Tap to expand

Asked in

G Google 15 a Amazon 12

The key insight is to use a stack to track nested tags while handling special CDATA sections that should be skipped entirely. Best approach is stack-based parsing with single pass validation. Time: O(n), Space: O(n)

Common Approaches

✓ Character-by-Character Validation

⏱️ Time: O(n²) Space: O(1)

Iterate through each character, manually checking for tag patterns, CDATA sections, and maintaining state without using a stack data structure.

Stack-Based Tag Matching

⏱️ Time: O(n) Space: O(n)

Parse the code using a stack to track opening tags and match them with corresponding closing tags. Handle CDATA sections by skipping their content entirely.

Character-by-Character Validation — Algorithm Steps

Step 1: Iterate through each character
Step 2: Manually track tag opening/closing state
Step 3: Validate tag names and nesting

Visualization

Tap to expand

Step-by-Step Walkthrough

Parse First Tag

Find and validate the opening tag name

Track Nesting

Manually track nested tags and CDATA sections

Validate Structure

Ensure proper tag matching and closure

Code -

solution.c — C

#include <stdio.h>
#include <stdlib.h>
#include <string.h>
#include <ctype.h>
#include <stdbool.h>

bool isValidTagName(const char* name, int len) {
    if (len < 1 || len > 9) return false;
    for (int i = 0; i < len; i++) {
        if (!isupper(name[i]) || !isalpha(name[i])) return false;
    }
    return true;
}

int findMatchingEndTag(const char* s, int start, const char* tagName, int tagLen) {
    int i = start;
    int sLen = strlen(s);
    
    while (i < sLen) {
        if (i + 9 <= sLen && strncmp(s + i, "<![CDATA[", 9) == 0) {
            char* cdataEnd = strstr(s + i + 9, "]]>");
            if (!cdataEnd) return -1;
            i = cdataEnd - s + 3;
        } else if (s[i] == '<') {
            if (i + 1 < sLen && s[i + 1] == '/') {
                char* closePos = strchr(s + i + 2, '>');
                if (!closePos) return -1;
                int endTagLen = closePos - (s + i + 2);
                if (endTagLen == tagLen && strncmp(s + i + 2, tagName, tagLen) == 0) {
                    return closePos - s + 1;
                }
                i = closePos - s + 1;
            } else {
                char* closePos = strchr(s + i + 1, '>');
                if (!closePos) return -1;
                int nestedTagLen = closePos - (s + i + 1);
                if (!isValidTagName(s + i + 1, nestedTagLen)) return -1;
                
                char nestedTag[10];
                strncpy(nestedTag, s + i + 1, nestedTagLen);
                nestedTag[nestedTagLen] = '\0';
                
                int nestedEnd = findMatchingEndTag(s, closePos - s + 1, nestedTag, nestedTagLen);
                if (nestedEnd == -1) return -1;
                i = nestedEnd;
            }
        } else {
            i++;
        }
    }
    return -1;
}

bool solution(char* code) {
    if (!code || strlen(code) == 0 || code[0] != '<') {
        return false;
    }
    
    char* firstClose = strchr(code, '>');
    if (!firstClose) {
        return false;
    }
    
    int firstTagLen = firstClose - code - 1;
    if (!isValidTagName(code + 1, firstTagLen)) {
        return false;
    }
    
    char firstTag[10];
    strncpy(firstTag, code + 1, firstTagLen);
    firstTag[firstTagLen] = '\0';
    
    int endPos = findMatchingEndTag(code, firstClose - code + 1, firstTag, firstTagLen);
    if (endPos == -1 || endPos != strlen(code)) {
        return false;
    }
    
    return true;
}

int main() {
    char code[10000];
    fgets(code, sizeof(code), stdin);
    code[strcspn(code, "\n")] = 0;
    
    bool result = solution(code);
    printf(result ? "true\n" : "false\n");
    return 0;
}

Time & Space Complexity

Time Complexity

⏱️

O(n²)

Each character may trigger validation of previous characters for matching tags

✓ Linear Growth

Space Complexity

O(1)

Only using variables to track current state

⚡ Linearithmic Space

28.0K Views

Medium Frequency

~35 min Avg. Time

485 Likes

Ln 1, Col 1

Smart Actions

💡 Explanation

AI Ready

💡 Suggestion Tab to accept Esc to dismiss

// Output will appear here after running code

Code Editor Closed

Click the red button to reopen

Tag Validator - Problem

Input & Output

Constraints

Visualization

Related Problems

Common Approaches

Character-by-Character Validation — Algorithm Steps

Visualization

Code -

Time & Space Complexity

Select Compiler