#192 Word Frequency - Solution

Write a bash script to calculate the frequency of each word in a text file words.txt.

For simplicity sake, you may assume:

words.txt contains only lowercase characters and space ' ' characters.
Each word must consist of lowercase characters only.
Words are separated by one or more whitespace characters.

Example:

Assume that words.txt has the following content:

the day is sunny the the
the sunny is is

Your script should output the following, sorted by descending frequency:

the 4
is 3
sunny 2
day 1

Note:

Don't worry about handling ties, it is guaranteed that each word's frequency count is unique.
Could you write it in one-line using Unix pipes?

The Word Frequency problem asks you to process a text file and print each word along with how many times it appears, sorted by frequency. Since the topic is Shell, the goal is to leverage command-line text processing utilities rather than implementing logic in a traditional programming language.

A common strategy is to first normalize and separate words so each appears on its own line. Tools like tr, awk, or similar utilities can help split text and handle whitespace. After that, you can sort the words so identical words become adjacent. Once grouped, use counting utilities to determine how many times each word occurs.

Finally, sort the results by frequency in descending order to match the expected output format. This pipeline-based approach efficiently processes large text files using Unix utilities designed for streaming data. The dominant cost typically comes from sorting operations, which influences the overall time complexity.