Article Categories

Selected Reading

Building a Question Answering System with Python and BERT

Python Server Side Programming Programming

In the realm of natural language processing (NLP), question answering systems have gained significant attention and have become an integral part of many applications. These systems are designed to understand human language and provide accurate responses to user queries, mimicking human-like interaction and enhancing user experiences. One such powerful model that has revolutionized the field of NLP is BERT (Bidirectional Encoder Representations from Transformers).

Bidirectional Encoder Representations from Transformers, developed by Google, stands as a state-of-the-art NLP model known for its remarkable performance on various NLP tasks, including question answering. BERT's key innovation lies in its ability to capture the context and meaning of words in a sentence by leveraging a transformer architecture and bidirectional training.

Understanding BERT Architecture

Traditional language models, such as word embeddings, have been successful in representing words based on their local context. However, they fail to capture the full context and meaning of words in a sentence, as they only consider the words that come before or after the target word. BERT addresses this limitation by adopting a bidirectional approach, where it considers both left and right contexts simultaneously.

BERT's architecture, based on the transformer model, utilizes self-attention mechanisms to capture dependencies and relationships between words in a sentence. By attending to all words simultaneously, BERT can generate rich contextualized representations that capture the complex semantic relationships between words.

Installation and Setup

To get started, we need to install the required libraries. Use the following command to install the transformers library ?

pip install transformers torch

Building the Question Answering System

Let's implement a complete question answering system using BERT. The system will take a question and context as input and return the predicted answer ?

import torch
from transformers import BertTokenizer, BertForQuestionAnswering

# Load the pretrained BERT model and tokenizer
model_name = 'bert-base-uncased'
tokenizer = BertTokenizer.from_pretrained(model_name)
model = BertForQuestionAnswering.from_pretrained(model_name)

# Function to predict the answer given a question and context
def predict_answer(question, context):
    encoding = tokenizer.encode_plus(
        question, 
        context, 
        return_tensors='pt', 
        max_length=512, 
        truncation=True,
        padding=True
    )
    
    input_ids = encoding['input_ids']
    attention_mask = encoding['attention_mask']
    
    with torch.no_grad():
        outputs = model(input_ids, attention_mask=attention_mask)
        start_scores = outputs.start_logits
        end_scores = outputs.end_logits
    
    start_index = torch.argmax(start_scores)
    end_index = torch.argmax(end_scores) + 1
    
    answer_tokens = tokenizer.convert_ids_to_tokens(input_ids[0][start_index:end_index])
    answer = tokenizer.convert_tokens_to_string(answer_tokens)
    
    return answer

# Test the question answering system
question = "What is the capital of France?"
context = "France, officially the French Republic, is a country whose capital is Paris. It is located in Western Europe and is known for its rich history and culture."

answer = predict_answer(question, context)
print("Question:", question)
print("Answer:", answer)

Question: What is the capital of France?
Answer: paris

Testing with Multiple Questions

Let's test our system with different types of questions to see how it performs ?

# Multiple test cases
test_cases = [
    {
        "question": "Who invented the telephone?",
        "context": "Alexander Graham Bell is credited with inventing the telephone in 1876. He was a Scottish-born inventor who worked in the United States."
    },
    {
        "question": "When was Python created?",
        "context": "Python programming language was created by Guido van Rossum and first released in 1991. It has become one of the most popular programming languages."
    },
    {
        "question": "What is machine learning?",
        "context": "Machine learning is a subset of artificial intelligence that enables computers to learn and make decisions from data without being explicitly programmed for every task."
    }
]

for i, test in enumerate(test_cases, 1):
    answer = predict_answer(test["question"], test["context"])
    print(f"Test {i}:")
    print(f"Question: {test['question']}")
    print(f"Answer: {answer}")
    print("-" * 50)

Test 1:
Question: Who invented the telephone?
Answer: alexander graham bell
--------------------------------------------------
Test 2:
Question: When was Python created?
Answer: 1991
--------------------------------------------------
Test 3:
Question: What is machine learning?
Answer: a subset of artificial intelligence that enables computers to learn and make decisions from data without being explicitly programmed for every task
--------------------------------------------------

Key Features of the System

Feature	Description	Benefit
Bidirectional Context	Considers both left and right context	Better understanding of word meaning
Pre-trained Model	Uses bert-base-uncased	No training required
Token-based Answers	Extracts answer spans from context	Precise answer extraction

How the System Works

The question answering system follows these steps:

Tokenization: The question and context are tokenized and combined
Encoding: Tokens are converted to numerical representations
Prediction: BERT predicts start and end positions of the answer
Decoding: Token positions are converted back to readable text

Limitations and Considerations

While powerful, this system has some limitations:

Answers must exist within the provided context
Maximum input length is limited to 512 tokens
Performance depends on the quality of the context provided

Conclusion

Building a question answering system with BERT demonstrates the power of transformer-based models in NLP. The system can accurately extract answers from context using bidirectional understanding. This approach opens possibilities for chatbots, information retrieval systems, and virtual assistants that require intelligent question answering capabilities.

S Vijay Balaji

Updated on: 2026-03-27T14:17:03+05:30

2K+ Views

Previous Next