Python program to count distinct words and count frequency of them

Suppose we have a list of words. These words may occur multiple times. We have to show the frequencies of these words and count how many distinct words are there.

So, if the input is like words = ["Book", "Sound", "Language", "Computer", "Book", "Language"], then the output will be (4, '2 1 2 1') because there are four distinct words, the first and third words have occurred twice.

To solve this, we will follow these steps −

  • d:= an OrderedDict to store items based on insert order
  • for each w in words, do
    • if w is in d, then
      • d[w] := d[w] + 1
    • otherwise,
      • d[w] := 1
  • a pair of the size of list of all keys in d and join all values from d into a string then return.


Let us see the following implementation to get better understanding

from collections import OrderedDict

def solve(words):
   for w in words:
      if w in d:
         d[w] += 1
         d[w] = 1
   return len(d.keys()), ' '.join([str(d[k]) for k in d.keys()])

words = ["Book", "Sound", "Language", "Computer", "Book", "Language"]


["Book", "Sound", "Language", "Computer", "Book", "Language"]


(4, '2 1 2 1')