Computer Vision¶

Teaching machines to interpret images and video — from recognition to generation.

Computer Vision is one of the core areas in the AI University map of AI. Explore the diagram, then dive into each topic — every subtopic grows into its own deep-dive over time.

flowchart TB
  IMG[/Image or Video/] --> CL[Classify]
  IMG --> DET[Detect]
  IMG --> SEG[Segment]
  IMG --> GENV[Generate]
  IMG --> D3[3D / Scene]

Key topics¶

Image classification

Assign labels to images; the task that kicked off the deep-learning era (ImageNet).
Object detection & segmentation

Locate and outline objects (bounding boxes, pixel masks).
Image & video generation

Create visual content with diffusion and transformer models.
3D & scene understanding

Depth, pose, neural radiance fields, and reconstructing the world from images.
Multimodal vision-language

Models that jointly understand images and text (captioning, visual question answering).

Deep Learning · Generative AI · Robotics & Embodied AI

Learn this properly

Want hands-on training in computer vision? Explore AI University courses and AI School camps for kids.

Computer Vision¶

Key topics¶

Related areas¶