Jump to content

Rokid Vision 2

From VR & AR Wiki


The Rokid Vision 2 was a cancelled augmented reality head-mounted display developed by Rokid, a Chinese technology company specializing in AR and MR devices. Unveiled on January 15, 2021, at the Rokid Open Day 2021 event in Hangzhou, China, the Vision 2 represented a significant redesign of the company's AR glasses lineup with a foldable, eyeglasses-like form factor.[1]

Rokid Vision 2
Basic Info
VR/AR Augmented Reality, Mixed Reality
Type Head-mounted display
Subtype AR Glasses (Tethered)
Platform Android, Windows, Gaming consoles
Creator Rokid
Price Discontinued
Website https://global.rokid.com
System
CPU Host device dependent
GPU Host device dependent
Storage
Display
Display Diffractive optical waveguide
Image
Optics
Tracking
Tracking 6DoF SLAM (single RGB camera)
Hand Tracking Yes (gesture recognition)
Audio
Microphone Dual microphones
Connectivity
Connectivity 4G/5G compatible (via host device)
Device
Input Voice control, head tracking, gestures

History and Development

Company Background

Rokid is a technology company founded in China, focusing on augmented reality and artificial intelligence solutions. The company has developed multiple generations of AR glasses for both consumer and enterprise markets.

Rokid Open Day 2021

On January 15, 2021, Rokid officially unveiled the Vision 2 at the Rokid Open Day 2021 online event hosted at the company's headquarters in Hangzhou, China. The announcement showcased the new hardware design and introduced several new XR applications developed specifically for the device.[2]

Cancellation

Despite the detailed announcement and demonstrated applications, the Rokid Vision 2 was ultimately cancelled before reaching commercial release. Rokid subsequently focused on other products in their AR glasses lineup, including the Rokid Air and Rokid Max series.

Design and Hardware

Form Factor

The Rokid Vision 2 was designed with a foldable form factor resembling ordinary eyeglasses, prioritizing portability and everyday wearability. The glasses could be folded for easy storage and transport.

Display Technology

The device utilized diffractive optical waveguide technology, which Rokid described as "world's leading." Key optical specifications included:

  • Light Transmittance: 85% - allowing users to clearly see the real world
  • Field of View: 40° diagonal
  • Equivalent Viewing Experience: Comparable to viewing a 120-inch TV from 4 meters away

Tethered Design

To achieve the lightweight, glasses-like form factor, the Vision 2 did not include onboard computing or power. The device required connection to external host devices for processing and power.

Audio System

The glasses featured dual integrated speakers and dual microphones, enabling hands-free voice control and audio feedback without requiring external audio devices.

Connectivity

The Rokid Vision 2 was designed to extend existing user interfaces by tethering to various devices:

  • Smartphones
  • Laptops
  • Gaming consoles
  • Tablets

The system was compatible with devices using 4G and 5G network connectivity.

Tracking and Input

Spatial Tracking

The Vision 2 incorporated 6DoF SLAM (Simultaneous Localization and Mapping) technology based on a single RGB camera, enabling users to move freely within virtual spaces while the system tracked their position.

Multimodal Interaction

The glasses supported multiple input methods:

  • Voice commands
  • Head tracking and control
  • Gesture recognition

Software and Applications

Rokid SXR SDK

Rokid released the SXR SDK alongside the Vision 2, providing developers with technologies and services including:

  • Multimodal interaction
  • Stereo rendering
  • Visual engine
  • Multimedia engine

The SDK supported development on Android and Unity platforms, and was compatible with hardware platforms from HiSilicon, Amlogic, and Qualcomm.

Pre-loaded Applications

Rokid developed several proprietary applications for the Vision 2:

Fantasy World

A 360-degree immersive space application featuring multimodal interaction through voice, head control, and gestures.

Holographic Cinema

An immersive 4D IMAX theater-like experience supporting 2D, 3D, and 360-degree panoramic video content.

Virtual City

An application utilizing the 6DoF SLAM technology to enable free movement within a virtual city environment.

Target Markets

Enterprise

Rokid positioned the Vision 2 for adoption in multiple industries:

  • Cultural tourism
  • Museums and exhibitions
  • Medical applications
  • Industrial use cases

Consumer

The device also targeted consumer applications:

  • Gaming
  • Media consumption
  • Multi-screen desktop displays for remote work

Specifications

Specification Details
Display Type Diffractive optical waveguide
Field of View 40° diagonal
Light Transmittance 85%
Tracking 6DoF SLAM
Camera Single RGB
Audio Dual speakers, dual microphones
Input Methods Voice, head tracking, gestures
Connectivity Tethered (phones, laptops, consoles, tablets)
Network Support 4G/5G compatible (via host)

See Also

References