Quick Summary
Apple has a new technology which could revitalise the Vision Pro.
It can produce a 3D render from a single 2D image.
While high profile tech releases are fairly common in this day and age, those which forge ahead into new frontiers are less so. For many, the slew of mixed reality and virtual reality headsets are the greatest indicator of a new era.
It has now been a few years since Apple launched its flagship in this area. The Vision Pro remains one of the most costly devices on the market, and some have suggested it doesn't quite live up to the price tag.
For me, a new piece of software from Apple might be the ticket to unlocking a real world use case for the device. That's a new open-source model, which can turn 2D photos into a 3D image.
Dubbed SHARP, the model predicts what a 3D render of the scene would look like, based on viewpoints within the image. Without getting too much into the nitty gritty, that decodes the depth of certain elements of the image, and uses them as waypoints within the 3D scene.
The big difference for Apple's technology is that it can produce a full 3D render from a single image. Other tools of this nature require hundreds of images of the same scene in a bid to produce a usable rendering.
New paper from Apple - Sharp Monocular View Synthesis in Less than a SecondMescheder et al. @ Apple just released a very impressive paper (congrats! 🎉🥳). You give it an image and it generates a really great looking 3d Gaussian representation. Uses depth pro. It's really good.… pic.twitter.com/XSZCZA8iioDecember 16, 2025
Having a one-shot system means that users can affect many images with ease, making the process easier than ever.
When I first read about this technology, I was a little blasé. Sure, it sounds fun, but who would actually use it?
Get all the latest news, reviews, deals and buying guides on gorgeous tech, home and active products from the T3 experts
Then I saw footage of the models on an Apple Vision Pro headset, and suddenly it made sense. Images captured as a snapshot of a moment in time were suddenly able to be interacted with, and moved through.
While there are definitely limitations in the design – it won't generate beyond the borders of the image, for example – the overall effect is solid, and adds a new layer of depth to old images.

Sam is an award-winning journalist with over six years of experience across print and digital media. As T3’s Senior Staff Writer, Sam covers everything from new phones and EVs to luxury watches and fragrances. Working across a range of different social media platforms alongside his written work, Sam is a familiar face for fans of T3. When he’s not reviewing snazzy products or hunting for stellar deals, Sam enjoys football, analog photography and writing music.
You must confirm your public display name before commenting
Please logout and then login again, you will then be prompted to enter your display name.