Computer Vision¶
Teaching machines to interpret images and video — from recognition to generation.
Computer Vision is one of the core areas in the AI University map of AI. Explore the diagram, then dive into each topic — every subtopic grows into its own deep-dive over time.
flowchart TB
IMG[/Image or Video/] --> CL[Classify]
IMG --> DET[Detect]
IMG --> SEG[Segment]
IMG --> GENV[Generate]
IMG --> D3[3D / Scene]
Key topics¶
-
Image classification
Assign labels to images; the task that kicked off the deep-learning era (ImageNet).
-
Object detection & segmentation
Locate and outline objects (bounding boxes, pixel masks).
-
Image & video generation
Create visual content with diffusion and transformer models.
-
3D & scene understanding
Depth, pose, neural radiance fields, and reconstructing the world from images.
-
Multimodal vision-language
Models that jointly understand images and text (captioning, visual question answering).
Related areas¶
Deep Learning · Generative AI · Robotics & Embodied AI
Learn this properly
Want hands-on training in computer vision? Explore AI University courses and AI School camps for kids.