Number of Valid Words in a Sentence - Solution & Explanation

Q: Is Number of Valid Words in a Sentence easy or hard?

Number of Valid Words in a Sentence is categorized as an Easy problem on LeetCode. The difficulty comes from carefully implementing the validation rules rather than complex algorithms or data structures.

Q: Number of Valid Words in a Sentence Python/Java solution

Both Python and Java solutions typically split the sentence into tokens and validate each token with conditional checks. The algorithm runs in O(n) time and constant space. Python and JavaScript also allow an alternative regex-based solution for concise validation.

Q: How to solve Number of Valid Words in a Sentence in O(n)?

Split the sentence into tokens and validate each token in a single pass. Reject tokens containing digits, multiple hyphens, or punctuation in the middle. Ensure a hyphen is surrounded by letters and punctuation appears only at the end. Because each character is checked once, the overall runtime stays O(n).

Q: What is the best approach for Number of Valid Words in a Sentence?

The best approach is iterative token validation. Split the sentence by spaces and scan each token character by character while enforcing the rules for digits, hyphens, and punctuation. This method runs in O(n) time and O(1) space and clearly demonstrates the logic behind the constraints.

Q: Is Number of Valid Words in a Sentence asked at Google/Amazon/Meta?

String parsing and token validation problems appear frequently in interviews at companies like Amazon and Google. While this exact problem may not always appear, the pattern of validating characters under strict rules is common in real interview questions.

Q: What data structure is used in Number of Valid Words in a Sentence?

The problem mainly uses basic string traversal and character checks. No complex data structures are required—just iteration through the string and a few flags to track hyphens or punctuation.

Q: What is the time complexity of Number of Valid Words in a Sentence?

The optimal time complexity is O(n), where n is the length of the sentence. Each character is processed at most once while validating tokens. Space complexity remains O(1) because only a few counters or flags are needed during validation.

EasyString17 min readAsked at: Meta, Oracle, Cisco

Practice this problem

Problem Statement

A sentence consists of lowercase letters ('a' to 'z'), digits ('0' to '9'), hyphens ('-'), punctuation marks ('!', '.', and ','), and spaces (' ') only. Each sentence can be broken down into one or more tokens separated by one or more spaces ' '.

A token is a valid word if all three of the following are true:

It only contains lowercase letters, hyphens, and/or punctuation (no digits).
There is at most one hyphen '-'. If present, it must be surrounded by lowercase characters ("a-b" is valid, but "-ab" and "ab-" are not valid).
There is at most one punctuation mark. If present, it must be at the end of the token ("ab,", "cd!", and "." are valid, but "a!b" and "c.," are not valid).

Examples of valid words include "a-b.", "afad", "ba-c", "a!", and "!".

Given a string sentence, return the number of valid words in sentence.

Example 1:

Input: sentence = "cat and  dog"
Output: 3
Explanation: The valid words in the sentence are "cat", "and", and "dog".

Example 2:

Input: sentence = "!this  1-s b8d!"
Output: 0
Explanation: There are no valid words in the sentence.
"!this" is invalid because it starts with a punctuation mark.
"1-s" and "b8d" are invalid because they contain digits.

Example 3:

Input: sentence = "alice and  bob are playing stone-game10"
Output: 5
Explanation: The valid words in the sentence are "alice", "and", "bob", "are", and "playing".
"stone-game10" is invalid because it contains digits.

Constraints:

1 <= sentence.length <= 1000
sentence only contains lowercase English letters, digits, ' ', '-', '!', '.', and ','.
There will be at least 1 token.

Approach Overview

Problem Overview: Given a sentence containing lowercase letters, digits, spaces, hyphens, and punctuation (! , .), count how many tokens are valid words. A valid word contains only lowercase letters, may include at most one hyphen surrounded by letters, and may end with a single punctuation mark. Tokens cannot contain digits.

The challenge is mostly careful string validation. You split the sentence by spaces and verify each token against the rules. The tricky parts are handling the hyphen placement and ensuring punctuation only appears at the end.

Approach 1: Simple Iterative Validation (O(n) time, O(1) space)

Split the sentence into tokens using whitespace and validate each token character by character. Track whether a hyphen or punctuation has already appeared. When iterating, reject the token if you encounter a digit, more than one hyphen, or punctuation in the middle of the word. If a hyphen appears, verify the characters before and after it are lowercase letters. If punctuation appears, ensure it is the final character. This approach uses direct character checks and a few boolean flags, which keeps memory constant.

The key insight: every rule can be validated in a single pass over the token. Since each character is processed exactly once, the total runtime across the sentence is linear. This method relies purely on basic string processing and conditional checks, which makes it fast and easy to implement in languages like C++, Java, or Python.

Approach 2: Regular Expression Matching (O(n) time, O(1) space)

A compact alternative is to encode the rules in a regular expression. After splitting the sentence into tokens, match each token against a pattern describing valid words. A typical pattern allows lowercase letters, an optional internal hyphen surrounded by letters, and an optional punctuation mark at the end. If the token matches the pattern, count it as valid.

This approach shifts the validation logic into the regex engine. The pattern effectively describes the grammar of a valid token, reducing manual checks in code. Performance remains linear relative to the sentence length because each token is matched once. It is especially concise in Python or JavaScript where regex support is straightforward.

Recommended for interviews: The iterative validation approach is usually preferred. It demonstrates that you can translate problem constraints into precise character checks and control flow. The regex solution is elegant but hides the reasoning inside a pattern, which some interviewers consider less explicit. Showing the manual validation first proves you understand the rules; mentioning the regex alternative shows broader familiarity with string parsing techniques.

Approach 1: Simple Iterative Approach

This approach involves splitting the sentence into tokens and iterating over each token to validate it based on the given criteria. This is a straightforward method that checks each character of a token to determine its validity.

This solution uses the standard C library functions to split the sentence into tokens using spaces. It then checks each token for validity by ensuring it contains no digits, at most one hyphen surrounded by letters, and at most one punctuation mark at the end.

Code

C C++Java Python C#JavaScript

C++

Java

Python

JavaScript

Complexity

Time Complexity: O(n), where n is the length of the sentence.
Space Complexity: O(n), due to the necessity to duplicate the string for tokenization.

Try this approach in the editor →

Approach 2: Regular Expression Approach

This approach leverages regular expressions to validate words by matching each token against a pre-defined pattern. It simplifies character validation and incorporates the constraints into a single expression.

The Python code uses regular expressions to define a valid word pattern: no digits allowed, optional hyphen between lowercase letters, and an optional punctuation mark at the end. It splits the sentence into tokens and uses the `re.match` to filter valid words.

Code

Python JavaScript

Python

JavaScript

Complexity

Time Complexity: O(n), where n is the length of the sentence.
Space Complexity: O(n), considering token splitting and regex storage.

Try this approach in the editor →

Approach 3: Simulation

First, we split the sentence into words by spaces, and then check each word to determine if it is a valid word.

For each word, we can use a boolean variable st to record whether a hyphen has already appeared, and then traverse each character in the word, judging according to the rules described in the problem.

For each character s[i], we have the following cases:

If s[i] is a digit, then s is not a valid word, and we return false directly;
If s[i] is a punctuation mark ('!', '.', ','), and i < len(s) - 1, then s is not a valid word, and we return false directly;
If s[i] is a hyphen, then we need to check if the following conditions are met:
- The hyphen can only appear once;
- The hyphen cannot appear at the beginning or end of the word;
- Both sides of the hyphen must be letters;
If s[i] is a letter, then we do not need to do anything.

Finally, we count the number of valid words in the sentence.

The time complexity is O(n), and the space complexity is O(n). Here, n is the length of the sentence.

Code

Python Java C++Go TypeScript

Python

Java

C++

TypeScript

Try this approach in the editor →

Complexity Comparison

Approach	Complexity
Simple Iterative Approach	Time Complexity: O(n), where n is the length of the sentence. Space Complexity: O(n), due to the necessity to duplicate the string for tokenization.
Regular Expression Approach	Time Complexity: O(n), where n is the length of the sentence. Space Complexity: O(n), considering token splitting and regex storage.
Simulation	—

Detailed Complexity Analysis

Approach	Time	Space	When to Use
Simple Iterative Validation	O(n)	O(1)	Best general solution. Clear rule checking and preferred in interviews.
Regular Expression Matching	O(n)	O(1)	Useful when regex support is strong and you want concise validation logic.

Video Solution

2047. Number of Valid Words in a Sentence | LEETCODE WEEKLY CONTEST 264 | CODE EXPLAINER • code Explainer • 2,057 views views

Watch 8 more video solutions →

Frequently Asked Questions

Is Number of Valid Words in a Sentence easy or hard?

Number of Valid Words in a Sentence is categorized as an Easy problem on LeetCode. The difficulty comes from carefully implementing the validation rules rather than complex algorithms or data structures.

Number of Valid Words in a Sentence Python/Java solution

Both Python and Java solutions typically split the sentence into tokens and validate each token with conditional checks. The algorithm runs in O(n) time and constant space. Python and JavaScript also allow an alternative regex-based solution for concise validation.

How to solve Number of Valid Words in a Sentence in O(n)?

Split the sentence into tokens and validate each token in a single pass. Reject tokens containing digits, multiple hyphens, or punctuation in the middle. Ensure a hyphen is surrounded by letters and punctuation appears only at the end. Because each character is checked once, the overall runtime stays O(n).

What is the best approach for Number of Valid Words in a Sentence?

The best approach is iterative token validation. Split the sentence by spaces and scan each token character by character while enforcing the rules for digits, hyphens, and punctuation. This method runs in O(n) time and O(1) space and clearly demonstrates the logic behind the constraints.

Is Number of Valid Words in a Sentence asked at Google/Amazon/Meta?

String parsing and token validation problems appear frequently in interviews at companies like Amazon and Google. While this exact problem may not always appear, the pattern of validating characters under strict rules is common in real interview questions.

What data structure is used in Number of Valid Words in a Sentence?

The problem mainly uses basic string traversal and character checks. No complex data structures are required—just iteration through the string and a few flags to track hyphens or punctuation.

What is the time complexity of Number of Valid Words in a Sentence?

The optimal time complexity is O(n), where n is the length of the sentence. Each character is processed at most once while validating tokens. Space complexity remains O(1) because only a few counters or flags are needed during validation.

Ready to solve this problem?

Practice Number of Valid Words in a Sentence with our built-in code editor and test cases.

Practice on FleetCode

Maximum Number of Words Found in Sentences

Problem Info

DifficultyEasy

Acceptance31.0%

Approaches3

Reading time17 min

Asked at

Meta Oracle Cisco

Practice this problem

Open in Editor

Number of Valid Words in a Sentence - Solution & Explanation

Problem Statement

Approach Overview

Approach 1: Simple Iterative Approach

Code

Complexity

Approach 2: Regular Expression Approach

Code

Complexity

Approach 3: Simulation

Code

Complexity Comparison

Detailed Complexity Analysis

Video Solution

Frequently Asked Questions

Ready to solve this problem?

Problem Info

Table of Contents

Number of Valid Words in a Sentence - Solution & Explanation

Problem Statement

Approach Overview

Approach 1: Simple Iterative Approach

Code

Complexity

Approach 2: Regular Expression Approach

Code

Complexity

Approach 3: Simulation

Code

Complexity Comparison

Detailed Complexity Analysis

Video Solution

Frequently Asked Questions

Ready to solve this problem?

Problem Info

Table of Contents

Problem Statement

Approach Overview

Approach 1: Simple Iterative Approach

Code

Complexity

Approach 2: Regular Expression Approach

Code

Complexity

Approach 3: Simulation

Code

Complexity Comparison

Detailed Complexity Analysis

Video Solution

Frequently Asked Questions

Related Problems

Ready to solve this problem?

Problem Info

Table of Contents

Problem Statement

Approach Overview

Approach 1: Simple Iterative Approach

Code

Complexity

Approach 2: Regular Expression Approach

Code

Complexity

Approach 3: Simulation

Code

Complexity Comparison

Detailed Complexity Analysis

Video Solution

Frequently Asked Questions

Related Problems

Ready to solve this problem?

Problem Info

Table of Contents