Consider the following documents : Implement all the four steps of Bag of Words (BoW) model to create a docum…

Question

Consider the following documents : Implement all the four steps of Bag of Words (BoW) model to create a document vector table.

Accepted Answer

## Model Answer **Step 1 – Collect Data (Documents):** - Doc 1: "NLP is a domain of AI." - Doc 2: "NLP stands for Natural Language Processing." **Step 2 – Tokenisation (create unique word list/vocabulary):** NLP, is, a, domain, of, AI, stands, for, Natural, Language, Processing *(Total 11 unique words)* **Step 3 – Create Document Vectors (word frequency count):** | Word | Doc 1 | Doc 2 | |------|-------|-------| | NLP | 1 | 1 | | is | 1 | 0 | | a | 1 | 0 | | domain | 1 | 0 | | of | 1 | 0 | | AI | 1 | 0 | | stands | 0 | 1 | | for | 0 | 1 | | Natural | 0 | 1 | | Language | 0 | 1 | | Processing | 0 | 1 | **Step 4 – Apply BoW Model:** Each document is represented as a vector of word frequencies: - Doc 1 → [1,1,1,1,1,1,0,0,0,0,0] - Doc 2 → [1,0,0,0,0,0,1,1,1,1,1] *Source: AI Chapter, Natural Language Processing – Bag of Words* --- ## Explanation Examiners award 1 mark per step. Ensure all four steps are clearly labelled. The vocabulary must list **unique** words only. The document vector t…

Artificial Intelligence — CBSE Class 10 board question