Large Language Models (LLMs) are advanced artificial intelligence systems trained on massive volumes of textual data to understand, process and generate human-like language.They are based primarily on deep learning and transformer architecture, enabling them to perform tasks such as:
| LLM / Model | Developer / Company | Type | Major Strength |
|---|---|---|---|
| GPT-5 | OpenAI | Proprietary | General reasoning, agentic AI, multimodal tasks |
| Claude 3.7 Sonnet | Anthropic | Proprietary | Coding, long reasoning, safety-focused AI |
| Gemini 2.5 Pro | Google DeepMind | Proprietary | Multimodal AI, long-context reasoning |
| Grok-3 | xAI | Proprietary | Mathematics, reasoning, web-integrated search |
| Llama 4 | Meta AI | Open-Weight | Efficient Mixture-of-Experts (MoE) architecture |
| DeepSeek-V3 / R1 | DeepSeek | Open-Weight | Cost-efficient advanced reasoning |
| Qwen2.5-Max / Qwen3 | Alibaba Cloud | Open-Weight | Multilingual AI, coding, enterprise applications |
| Mistral Large 3 | Mistral AI | Open-Weight | Enterprise AI, efficient inference |
| Kimi K2 | Moonshot AI | Open-Weight | Large-context processing |
| Phi-4 | Microsoft AI | Small Language Model (SLM) | Lightweight AI for smaller devices |
| Command R+ | Cohere | Enterprise LLM | Retrieval-Augmented Generation (RAG) |
| Gemini Nano | On-device AI | Mobile and edge-device AI | |
| Copilot Models | Microsoft & OpenAI | Proprietary | Coding and productivity assistance |
| Falcon | Technology Innovation Institute (TII) | Open-Source | Arabic and multilingual AI |
| Yi Models | 01.AI | Open-Weight | Bilingual Chinese-English tasks |
| StableLM | Stability AI | Open-Source | Lightweight generative AI |
| Jurassic-2 | AI21 Labs | Proprietary | Enterprise text generation |
| PaLM (earlier generation) | Proprietary | Foundation model research | |
| BLOOM | BigScience Consortium | Open-Source | Multilingual collaborative AI |
| OPT | Meta AI | Open-Source | Research-focused transformer model |
Artificial Intelligence (AI) is much broader than Large Language Models (LLMs) such as GPT, Gemini or Claude. While LLMs focus mainly on understanding and generating human language, many AI systems operate without language modelling and are designed for specialised tasks such as image recognition, prediction, robotics, scientific discovery and decision-making.These systems are collectively referred to as Non-LLM AI Systems or Specialised AI Systems.
Non-LLM AI systems are artificial intelligence models that do not primarily rely on large-scale text generation.They generally focus on:
AI systems designed to analyse and interpret images and videos.
| AI System | Developer | Use |
|---|---|---|
| Segment Anything Model (SAM) | Meta AI | Image segmentation |
| YOLO (You Only Look Once) | Open-source community | Real-time object detection |
| OpenCV | OpenCV Foundation | Computer vision library |
| Facial Recognition Systems | Various companies | Security and authentication |
AI systems capable of generating images, audio, proteins or structures instead of text.
| AI System | Developer | Use |
|---|---|---|
| Stable Diffusion | Stability AI | AI image generation |
| Midjourney | Midjourney | Artistic image generation |
| AlphaFold | Google DeepMind | Protein structure prediction |
| Latent Consistency Models (LCMs) | Research community | Fast image generation |
AlphaFold revolutionised biology by accurately predicting 3D protein structures.
AI systems used for forecasting future outcomes using numerical and statistical data.
| AI System | Application |
|---|---|
| Recommendation Engines | Netflix, Amazon, YouTube |
| ARIMA Models | Time-series forecasting |
| Prophet Models | Financial forecasting |
| Gradient Boosting Models (GBMs) | Classification and prediction |
AI systems that learn through trial-and-error interactions with environments to maximise rewards.
| AI System | Developer | Achievement |
|---|---|---|
| AlphaGo | Google DeepMind | Defeated world Go champion |
| AlphaZero | Google DeepMind | Self-learning game AI |
| Robotics Control Systems | Various companies | Autonomous robots |
Models focused on language understanding rather than text generation.
| Model | Developer | Use |
|---|---|---|
| BERT | Search understanding | |
| RoBERTa | Meta AI | NLP tasks |
| DeBERTa | Microsoft AI | Semantic analysis |
Compact AI models designed to run efficiently on smaller hardware.
| Model | Developer |
|---|---|
| Phi-3 / Phi-4 | Microsoft AI |
| Gemma |
| AI System | UPSC Relevance |
|---|---|
| AlphaFold | Biotechnology |
| YOLO | Computer vision |
| AlphaGo | Reinforcement learning |
| Stable Diffusion | Generative AI |
| BERT | NLP |
| Random Forest | Machine learning |
| OpenCV | Image processing |