Research

Paper

AI LLM March 19, 2026

Comparative Analysis of Large Language Models in Generating Telugu Responses for Maternal Health Queries

Authors

Anagani Bhanusree, Sai Divya Vissamsetty, K VenkataKrishna Rao, Rimjhim

Abstract

Large Language Models (LLMs) have been progressively exhibiting there capabilities in various areas of research. The performance of the LLMs in acute maternal healthcare area, predominantly in low resource languages like Telugu, Hindi, Tamil, Urdu etc are still unstudied. This study presents how ChatGPT-4o, GeminiAI, and Perplexity AI respond to pregnancy related questions asked in different languages. A bilingual dataset is used to obtain results by applying the semantic similarity metrics (BERT Score) and expert assessments from expertise gynecologists. Multiple parameters like accuracy, fluency, relevance, coherence and completeness are taken into consideration by the gynecologists to rate the responses generated by the LLMs. Gemini excels in other LLMs in terms of producing accurate and coherent pregnancy relevant responses in Telugu, while Perplexity demonstrated well when the prompts were in Telugu. ChatGPT's performance can be improved. The results states that both selecting an LLM and prompting language plays a crucial role in retrieving the information. Altogether, we emphasize for the improvement of LLMs assistance in regional languages for healthcare purposes.

Metadata

arXiv ID: 2603.18898
Provider: ARXIV
Primary Category: cs.IR
Published: 2026-03-19
Fetched: 2026-03-20 06:02

Related papers

Raw Data (Debug)
{
  "raw_xml": "<entry>\n    <id>http://arxiv.org/abs/2603.18898v1</id>\n    <title>Comparative Analysis of Large Language Models in Generating Telugu Responses for Maternal Health Queries</title>\n    <updated>2026-03-19T13:39:38Z</updated>\n    <link href='https://arxiv.org/abs/2603.18898v1' rel='alternate' type='text/html'/>\n    <link href='https://arxiv.org/pdf/2603.18898v1' rel='related' title='pdf' type='application/pdf'/>\n    <summary>Large Language Models (LLMs) have been progressively exhibiting there capabilities in various areas of research. The performance of the LLMs in acute maternal healthcare area, predominantly in low resource languages like Telugu, Hindi, Tamil, Urdu etc are still unstudied. This study presents how ChatGPT-4o, GeminiAI, and Perplexity AI respond to pregnancy related questions asked in different languages. A bilingual dataset is used to obtain results by applying the semantic similarity metrics (BERT Score) and expert assessments from expertise gynecologists. Multiple parameters like accuracy, fluency, relevance, coherence and completeness are taken into consideration by the gynecologists to rate the responses generated by the LLMs. Gemini excels in other LLMs in terms of producing accurate and coherent pregnancy relevant responses in Telugu, while Perplexity demonstrated well when the prompts were in Telugu. ChatGPT's performance can be improved. The results states that both selecting an LLM and prompting language plays a crucial role in retrieving the information. Altogether, we emphasize for the improvement of LLMs assistance in regional languages for healthcare purposes.</summary>\n    <category scheme='http://arxiv.org/schemas/atom' term='cs.IR'/>\n    <published>2026-03-19T13:39:38Z</published>\n    <arxiv:primary_category term='cs.IR'/>\n    <author>\n      <name>Anagani Bhanusree</name>\n    </author>\n    <author>\n      <name>Sai Divya Vissamsetty</name>\n    </author>\n    <author>\n      <name>K VenkataKrishna Rao</name>\n    </author>\n    <author>\n      <name> Rimjhim</name>\n    </author>\n  </entry>"
}