Back to AI Lab
Foundation Models
Research papers, repositories, and articles about foundation models
Showing 2 of 2 items
Next-Embedding Prediction Makes Strong Vision Learners
Instead of predicting pixels or patches, this method predicts the next embedding in a learned space. Vision folks can plug this into pretraining to squeeze more out of ImageNet-scale data.
Sihan Xu, Ziqiao Ma
Depth Any Panoramas: A Foundation Model for Panoramic Depth Estimation
Depth Any Panoramas builds a single model for depth on 360° indoor and outdoor scenes. Robotics and AR teams can reuse this instead of training per-dataset depth nets.
Xin Lin, Meixi Song