Admin Team
08 May

LARGE LANGUAGE MODELS (LLMs)

Large Language Models (LLMs) are advanced artificial intelligence systems trained on massive volumes of textual data to understand, process and generate human-like language.They are based primarily on deep learning and transformer architecture, enabling them to perform tasks such as:

  • Text generation
  • Translation
  • Summarisation
  • Question answering
  • Coding assistance
  • Conversational interaction
  • Reasoning and analysis


MAJOR LARGE LANGUAGE MODELS (LLMs) AND THEIR DEVELOPERS (2025–26)

LLM / ModelDeveloper / CompanyTypeMajor Strength
GPT-5OpenAIProprietaryGeneral reasoning, agentic AI, multimodal tasks
Claude 3.7 SonnetAnthropicProprietaryCoding, long reasoning, safety-focused AI
Gemini 2.5 ProGoogle DeepMindProprietaryMultimodal AI, long-context reasoning
Grok-3xAIProprietaryMathematics, reasoning, web-integrated search
Llama 4Meta AIOpen-WeightEfficient Mixture-of-Experts (MoE) architecture
DeepSeek-V3 / R1DeepSeekOpen-WeightCost-efficient advanced reasoning
Qwen2.5-Max / Qwen3Alibaba CloudOpen-WeightMultilingual AI, coding, enterprise applications
Mistral Large 3Mistral AIOpen-WeightEnterprise AI, efficient inference
Kimi K2Moonshot AIOpen-WeightLarge-context processing
Phi-4Microsoft AISmall Language Model (SLM)Lightweight AI for smaller devices
Command R+CohereEnterprise LLMRetrieval-Augmented Generation (RAG)
Gemini NanoGoogleOn-device AIMobile and edge-device AI
Copilot ModelsMicrosoft & OpenAIProprietaryCoding and productivity assistance
FalconTechnology Innovation Institute (TII)Open-SourceArabic and multilingual AI
Yi Models01.AIOpen-WeightBilingual Chinese-English tasks
StableLMStability AIOpen-SourceLightweight generative AI
Jurassic-2AI21 LabsProprietaryEnterprise text generation
PaLM (earlier generation)GoogleProprietaryFoundation model research
BLOOMBigScience ConsortiumOpen-SourceMultilingual collaborative AI
OPTMeta AIOpen-SourceResearch-focused transformer model

NON-LLM AI SYSTEMS : UPSC SCIENCE & TECHNOLOGY NOTES

INTRODUCTION

Artificial Intelligence (AI) is much broader than Large Language Models (LLMs) such as GPT, Gemini or Claude. While LLMs focus mainly on understanding and generating human language, many AI systems operate without language modelling and are designed for specialised tasks such as image recognition, prediction, robotics, scientific discovery and decision-making.These systems are collectively referred to as Non-LLM AI Systems or Specialised AI Systems.


WHAT ARE NON-LLM AI SYSTEMS?

Non-LLM AI systems are artificial intelligence models that do not primarily rely on large-scale text generation.They generally focus on:

  • Visual understanding
  • Prediction and forecasting
  • Robotics
  • Scientific computation
  • Decision optimisation
  • Classification and anomaly detection

MAJOR TYPES OF NON-LLM AI SYSTEMS

1. COMPUTER VISION SYSTEMS

Meaning

AI systems designed to analyse and interpret images and videos.

Major Functions

  • Object detection
  • Image classification
  • Facial recognition
  • Video analytics
  • Medical imaging

IMPORTANT EXAMPLES

AI SystemDeveloperUse
Segment Anything Model (SAM)Meta AIImage segmentation
YOLO (You Only Look Once)Open-source communityReal-time object detection
OpenCVOpenCV FoundationComputer vision library
Facial Recognition SystemsVarious companiesSecurity and authentication

2. NON-TEXT GENERATIVE AI MODELS

Meaning

AI systems capable of generating images, audio, proteins or structures instead of text.


IMPORTANT EXAMPLES

AI SystemDeveloperUse
Stable DiffusionStability AIAI image generation
MidjourneyMidjourneyArtistic image generation
AlphaFoldGoogle DeepMindProtein structure prediction
Latent Consistency Models (LCMs)Research communityFast image generation

ALPHAFOLD : VERY IMPORTANT

AlphaFold revolutionised biology by accurately predicting 3D protein structures.

3. PREDICTIVE AND NUMERICAL AI

Meaning

AI systems used for forecasting future outcomes using numerical and statistical data.


IMPORTANT EXAMPLES

AI SystemApplication
Recommendation EnginesNetflix, Amazon, YouTube
ARIMA ModelsTime-series forecasting
Prophet ModelsFinancial forecasting
Gradient Boosting Models (GBMs)Classification and prediction

4. REINFORCEMENT LEARNING (RL) AGENTS

Meaning

AI systems that learn through trial-and-error interactions with environments to maximise rewards.


IMPORTANT EXAMPLES

AI SystemDeveloperAchievement
AlphaGoGoogle DeepMindDefeated world Go champion
AlphaZeroGoogle DeepMindSelf-learning game AI
Robotics Control SystemsVarious companiesAutonomous robots

5. ENCODER-ONLY LANGUAGE MODELS (NON-GENERATIVE)

Meaning

Models focused on language understanding rather than text generation.


IMPORTANT EXAMPLES

ModelDeveloperUse
BERTGoogleSearch understanding
RoBERTaMeta AINLP tasks
DeBERTaMicrosoft AISemantic analysis

SMALL LANGUAGE MODELS (SLMs)

Meaning

Compact AI models designed to run efficiently on smaller hardware.


IMPORTANT EXAMPLES

ModelDeveloper
Phi-3 / Phi-4Microsoft AI
GemmaGoogle

IMPORTANT UPSC EXAMPLES

AI SystemUPSC Relevance
AlphaFoldBiotechnology
YOLOComputer vision
AlphaGoReinforcement learning
Stable DiffusionGenerative AI
BERTNLP
Random ForestMachine learning
OpenCVImage processing
Comments
* The email will not be published on the website.