Explain the following picture which depicts one of the processes on NLP. Also mention the purpose which will be achieved by this process.
Generated by claude-sonnet-4-6 · 2026-06-21 03:18 · grounding rag
Model Answer
The diagram depicts Converting Text to a Common Case, a step in Text Normalisation (Text Processing) in NLP.
In this process, all words — regardless of their capitalisation (e.g., HELLO, HeLlo, HELLo) — are converted to lowercase (hello).
Purpose: This ensures uniformity in word representation, so that the machine does not treat the same word written in different cases as different words, thereby improving accuracy of text analysis.
Source: Chapter 6, Section 6.5 – Text Processing (Text Normalisation)
---
Explanation
- The examiner wants you to name the step (Converting Text to Common Case / Lowercase Conversion) and state its purpose — both are needed for full 2 marks.
- Quote from the textbook: "This ensures that the case sensitivity of the machine does not consider the same words as different just because of different cases."
- Don't confuse this with Tokenisation or Stop Word Removal — those are different steps. This specifically deals with case uniformity.