Question 1

What is the difference between a greedy and a lazy quantifier?

Accepted Answer

A greedy quantifier like * or + tries to match as many characters as possible, then backtracks if needed. A lazy quantifier like *? or +? matches as few characters as possible and expands only if the rest of the pattern fails to match. For example, (.*)<\/b> against 'one<\/b> and two<\/b>' matches everything from the first tag to the last closing tag (greedy), while (.*?)<\/b> correctly finds two separate matches (lazy).

Question 2

How do lookaheads and lookbehinds work, and do they consume characters?

Accepted Answer

Lookaheads and lookbehinds are zero-width assertions — they check for a condition at the current position without advancing the match cursor or consuming any characters. A positive lookahead (?=abc) means 'the next characters must be abc, but do not include them in the match'. A negative lookahead (?!abc) means 'the next characters must NOT be abc'. Lookbehinds do the same thing but look at what came before the current position. They are commonly stacked at the start of a pattern to enforce multiple conditions on the same string simultaneously.

Question 3

Why does my email regex reject valid addresses or accept invalid ones?

Accepted Answer

Email validation with regex is inherently approximate. The full RFC 5321 specification allows quoted strings, comments, IP address literals in brackets, and other forms that almost never appear in practice. Most production systems use a simple regex that accepts any reasonable-looking address (local part + @ + domain + TLD) and then verify the address by sending a confirmation email. A regex that is too strict rejects real users; one that is too loose lets garbage through. The pattern included in this builder covers nearly all real-world addresses while ignoring exotic edge cases.

Question 4

What regex flags should I use and when?

Accepted Answer

The most commonly needed flags are: g (global) to find all matches instead of stopping after the first; i (case-insensitive) so you do not need to write [a-zA-Z] everywhere; m (multiline) to make ^ and $ match the start and end of each line rather than the whole string; and s (dotall) to make the dot . match newline characters as well. In JavaScript, flags are passed as the second argument to new RegExp() or as the suffix in a literal like /pattern/gi. Combining g and i is very common for search-and-replace operations.

Question 5

How do named capturing groups work and how do I access them?

Accepted Answer

Named capturing groups use the syntax (?<name>...) and let you reference a captured value by a readable label instead of a numeric index. In JavaScript, after calling string.match() or regexp.exec(), the captured values are available on the .groups property of the result object — for example result.groups.year for a group named year. Named groups make complex patterns far easier to maintain because renaming or reordering groups does not break references elsewhere in your code.

Question 6

Can regex validate nested structures like HTML or JSON?

Accepted Answer

Not reliably. Regular expressions describe regular languages, which by definition cannot handle arbitrarily nested or recursive structures. HTML and JSON both have nesting that can go to any depth, which requires a pushdown automaton (i.e., a proper parser) to handle correctly. Regex can extract simple, non-nested fragments from HTML — like the content of a single tag — but it will fail on nested tags of the same type. For serious HTML or JSON processing, use a dedicated parser such as the browser's DOMParser for HTML or JSON.parse() for JSON. Use regex only for pre-validation or extracting known-flat patterns.

🧩 Regex Cheatsheet & Builder

🧩 Regex Cheatsheet & Builder

12 Regex Patterns Every Developer Should Have Memorised (and How to Build Your Own)

1. The Email Pattern That Actually Works (Mostly)

2. IPv4 Address With Range Validation

3. ISO Date (YYYY-MM-DD) With Month Validation

4. URL Matching — Greedy Enough to Be Useful

5. US Phone Number — Multiple Formats at Once

6. Hex Color Codes — Both 3 and 6 Digit Forms

7. JWT Token Structure

8. Lookaheads — The Regex Superpower Most Developers Skip

9. Named Capturing Groups — Readable Extraction

10. Non-Greedy Quantifiers Save You From Matching Too Much

11. The Slug Pattern — URL-Safe Strings

12. Backreferences for Duplicate Detection

Building Your Own Patterns: A Mental Model

FAQ