08 May

LARGE LANGUAGE MODELS (LLMs)

Large Language Models (LLMs) are advanced artificial intelligence systems trained on massive volumes of textual data to understand, process and generate human-like language.They are based primarily on deep learning and transformer architecture, enabling them to perform tasks such as:

Text generation
Translation
Summarisation
Question answering
Coding assistance
Conversational interaction
Reasoning and analysis

MAJOR LARGE LANGUAGE MODELS (LLMs) AND THEIR DEVELOPERS (2025–26)

LLM / Model	Developer / Company	Type	Major Strength
GPT-5	OpenAI	Proprietary	General reasoning, agentic AI, multimodal tasks
Claude 3.7 Sonnet	Anthropic	Proprietary	Coding, long reasoning, safety-focused AI
Gemini 2.5 Pro	Google DeepMind	Proprietary	Multimodal AI, long-context reasoning
Grok-3	xAI	Proprietary	Mathematics, reasoning, web-integrated search
Llama 4	Meta AI	Open-Weight	Efficient Mixture-of-Experts (MoE) architecture
DeepSeek-V3 / R1	DeepSeek	Open-Weight	Cost-efficient advanced reasoning
Qwen2.5-Max / Qwen3	Alibaba Cloud	Open-Weight	Multilingual AI, coding, enterprise applications
Mistral Large 3	Mistral AI	Open-Weight	Enterprise AI, efficient inference
Kimi K2	Moonshot AI	Open-Weight	Large-context processing
Phi-4	Microsoft AI	Small Language Model (SLM)	Lightweight AI for smaller devices
Command R+	Cohere	Enterprise LLM	Retrieval-Augmented Generation (RAG)
Gemini Nano	Google	On-device AI	Mobile and edge-device AI
Copilot Models	Microsoft & OpenAI	Proprietary	Coding and productivity assistance
Falcon	Technology Innovation Institute (TII)	Open-Source	Arabic and multilingual AI
Yi Models	01.AI	Open-Weight	Bilingual Chinese-English tasks
StableLM	Stability AI	Open-Source	Lightweight generative AI
Jurassic-2	AI21 Labs	Proprietary	Enterprise text generation
PaLM (earlier generation)	Google	Proprietary	Foundation model research
BLOOM	BigScience Consortium	Open-Source	Multilingual collaborative AI
OPT	Meta AI	Open-Source	Research-focused transformer model

NON-LLM AI SYSTEMS : UPSC SCIENCE & TECHNOLOGY NOTES

INTRODUCTION

Artificial Intelligence (AI) is much broader than Large Language Models (LLMs) such as GPT, Gemini or Claude. While LLMs focus mainly on understanding and generating human language, many AI systems operate without language modelling and are designed for specialised tasks such as image recognition, prediction, robotics, scientific discovery and decision-making.These systems are collectively referred to as Non-LLM AI Systems or Specialised AI Systems.

WHAT ARE NON-LLM AI SYSTEMS?

Non-LLM AI systems are artificial intelligence models that do not primarily rely on large-scale text generation.They generally focus on:

Visual understanding
Prediction and forecasting
Robotics
Scientific computation
Decision optimisation
Classification and anomaly detection

MAJOR TYPES OF NON-LLM AI SYSTEMS

1. COMPUTER VISION SYSTEMS

Meaning

AI systems designed to analyse and interpret images and videos.

Major Functions

Object detection
Image classification
Facial recognition
Video analytics
Medical imaging

IMPORTANT EXAMPLES

AI System	Developer	Use
Segment Anything Model (SAM)	Meta AI	Image segmentation
YOLO (You Only Look Once)	Open-source community	Real-time object detection
OpenCV	OpenCV Foundation	Computer vision library
Facial Recognition Systems	Various companies	Security and authentication

2. NON-TEXT GENERATIVE AI MODELS

Meaning

AI systems capable of generating images, audio, proteins or structures instead of text.

IMPORTANT EXAMPLES

AI System	Developer	Use
Stable Diffusion	Stability AI	AI image generation
Midjourney	Midjourney	Artistic image generation
AlphaFold	Google DeepMind	Protein structure prediction
Latent Consistency Models (LCMs)	Research community	Fast image generation

ALPHAFOLD : VERY IMPORTANT

AlphaFold revolutionised biology by accurately predicting 3D protein structures.

3. PREDICTIVE AND NUMERICAL AI

Meaning

AI systems used for forecasting future outcomes using numerical and statistical data.

IMPORTANT EXAMPLES

AI System	Application
Recommendation Engines	Netflix, Amazon, YouTube
ARIMA Models	Time-series forecasting
Prophet Models	Financial forecasting
Gradient Boosting Models (GBMs)	Classification and prediction

4. REINFORCEMENT LEARNING (RL) AGENTS

Meaning

AI systems that learn through trial-and-error interactions with environments to maximise rewards.

IMPORTANT EXAMPLES

AI System	Developer	Achievement
AlphaGo	Google DeepMind	Defeated world Go champion
AlphaZero	Google DeepMind	Self-learning game AI
Robotics Control Systems	Various companies	Autonomous robots

5. ENCODER-ONLY LANGUAGE MODELS (NON-GENERATIVE)

Meaning

Models focused on language understanding rather than text generation.

IMPORTANT EXAMPLES

Model	Developer	Use
BERT	Google	Search understanding
RoBERTa	Meta AI	NLP tasks
DeBERTa	Microsoft AI	Semantic analysis

SMALL LANGUAGE MODELS (SLMs)

Meaning

Compact AI models designed to run efficiently on smaller hardware.

IMPORTANT EXAMPLES

Model	Developer
Phi-3 / Phi-4	Microsoft AI
Gemma	Google

IMPORTANT UPSC EXAMPLES

AI System	UPSC Relevance
AlphaFold	Biotechnology
YOLO	Computer vision
AlphaGo	Reinforcement learning
Stable Diffusion	Generative AI
BERT	NLP
Random Forest	Machine learning
OpenCV	Image processing

UPPSCMPPSCBPSCUPSC

Comments

MAJOR LARGE LANGUAGE MODELS (LLMs) AND NON-LLM AI SYSTEMS (2025–26)

LARGE LANGUAGE MODELS (LLMs)

MAJOR LARGE LANGUAGE MODELS (LLMs) AND THEIR DEVELOPERS (2025–26)

NON-LLM AI SYSTEMS : UPSC SCIENCE & TECHNOLOGY NOTES

INTRODUCTION

WHAT ARE NON-LLM AI SYSTEMS?

MAJOR TYPES OF NON-LLM AI SYSTEMS

1. COMPUTER VISION SYSTEMS

Meaning

Major Functions

IMPORTANT EXAMPLES

2. NON-TEXT GENERATIVE AI MODELS

Meaning

IMPORTANT EXAMPLES

ALPHAFOLD : VERY IMPORTANT

3. PREDICTIVE AND NUMERICAL AI

Meaning

IMPORTANT EXAMPLES

4. REINFORCEMENT LEARNING (RL) AGENTS

Meaning

IMPORTANT EXAMPLES

5. ENCODER-ONLY LANGUAGE MODELS (NON-GENERATIVE)

Meaning

IMPORTANT EXAMPLES

SMALL LANGUAGE MODELS (SLMs)

Meaning

IMPORTANT EXAMPLES

IMPORTANT UPSC EXAMPLES