Human visual system
The human visual system is the part of the human nervous system that detects light and turns it into the perception of images. It includes the eyes, which gather and focus light onto the retina, and the neural pathways that carry signals from the retina through the optic nerve to the visual cortex of the brain.[1] Its physical limits, the size of its field of view, how finely it resolves detail, how it judges depth, and how it perceives motion, set the design targets for virtual reality (VR) and augmented reality (AR) hardware. A head-mounted display does not need to exceed what the eye can perceive, and falling short of it produces the visible artifacts that VR engineering works to remove.[2]
Several properties of the visual system are referenced directly in VR and AR specifications and research: the roughly 200 degree horizontal field of view, the concentration of fine detail in a small central region called the fovea, the angular resolution limit of about one arcminute per line, the link between eye convergence and lens focus, and the way the eye smears a held image during motion. These give rise to the design problems of matching display field of view, achieving so-called retinal resolution, foveated rendering, the vergence-accommodation conflict, and low-persistence displays.
Anatomy and signal path
Light enters the eye through the cornea and the pupil, is focused by the lens, and forms an image on the retina at the back of the eye. The retina contains two classes of photoreceptor cell: rods and cones. A single human retina holds on the order of 6 to 7 million cones and roughly 120 million rods, so rods outnumber cones by about 20 to 1.[3] Cones operate in bright light and provide color vision and fine detail; rods are far more sensitive and dominate in dim light but do not distinguish color and have lower spatial resolution.[4]
Human color vision is trichromatic. Cones come in three types, designated S, M and L for their short-, medium- and long-wavelength sensitivity, each carrying a different photopigment (opsin). The brain compares the relative responses of the three cone types to perceive color.[5] The point on the retina where the optic nerve fibers leave the eye, the optic disc, contains no photoreceptors and produces a blind spot in each eye's field; the brain normally fills in this gap so it is not consciously noticed.[6]
Field of view
The human visual field is much wider horizontally than vertically and is not uniform across its extent. The total horizontal field, using both eyes, spans roughly 180 to 200 degrees. A single eye covers about 135 to 160 degrees horizontally, and the central region where the two eyes overlap, the binocular field, is about 114 degrees wide. The vertical field is about 135 degrees. These figures vary between sources and individuals.[7][8]
The binocular overlap matters for VR because it is the region in which the brain can compare the two slightly different eye images to extract depth, a process called stereopsis. Outside the overlap, on the far left and right, vision is monocular. Peripheral vision is low in detail and largely insensitive to color, but it is sensitive to motion and contributes to the sense of presence and spatial awareness.[7][9]
| Measure | Approximate value |
|---|---|
| Horizontal, both eyes | 180 to 200 degrees |
| Horizontal, single eye | 135 to 160 degrees |
| Binocular overlap (stereo region) | about 114 degrees |
| Vertical | about 135 degrees |
Visual acuity and the fovea
Fine detail is not spread evenly across the retina. Cones are packed most densely at the center of the fovea, where peak density averages roughly 199,000 cones per square millimeter (with wide variation between individuals), and acuity is highest there.[10] Acuity drops off quickly with eccentricity, the angular distance away from the point of fixation. There is already a measurable loss in acuity about 5 arcminutes from the center of fixation, and a roughly 25 percent loss by 10 arcminutes (one sixth of a degree).[4] The small region of sharpest sight covers only about a 5 degree central circle of the visual field.[7]
The standard for normal vision, 20/20 (6/6), corresponds to a minimum angle of resolution of one arcminute: one stroke of a letter subtends one minute of arc at the eye. The peak resolving capacity of the eye under good conditions is about 60 cycles per degree, equivalent to one arcminute per cycle.[4] The theoretical limit set by cone spacing alone is higher, on the order of 150 cycles per degree, but optical imperfections of the eye keep everyday performance below that.[4]
In 2025, researchers at the University of Cambridge and Meta Reality Labs measured the practical resolution limit of foveal vision more directly. Using a sliding high-resolution display, they reported a limit of about 94 pixels per degree for achromatic (greyscale) detail and about 89 pixels per degree for red-green color detail, with one participant reaching 120 pixels per degree. Yellow-violet detail was resolved at a lower limit of about 53 pixels per degree. The work was published in Nature Communications.[11][12] These figures revise upward the round number of 60 pixels per degree that the display industry had long treated as the limit of human perception.[13]
Depth perception
The visual system combines several cues to judge distance. Binocular cues come from the two eyes: stereopsis uses the disparity between the slightly different images on each retina, and vergence uses the inward rotation of the eyes needed to point both at the same near object. Monocular cues, available to a single eye, include occlusion (a near object hiding a far one), relative size, motion parallax, and the focus adjustment of the lens, called accommodation.[1][14]
In natural viewing, vergence and accommodation are coupled: when the eyes converge on a near object, the lenses focus to the same distance, and the two cues agree. This coupling is central to one of the main comfort problems in stereoscopic VR and AR, described below.[14]
Motion perception and persistence
The visual system does not sample the world in discrete frames, but it does have temporal limits. A held image leaves a brief afterimage on the retina, and the eye integrates light over a short interval rather than instantaneously. When the eye tracks a moving object (smooth pursuit) or the head turns, an image that stays lit and static for the duration of a display frame is dragged across the retina relative to where the eye is pointing, which the brain perceives as motion blur and judder.[15] This effect is the reason VR displays use short, strobed illumination, covered in the VR and AR section.
Role in virtual reality and augmented reality
Most of the perceptual targets and failure modes of VR and AR hardware are stated directly in terms of the human visual system. The eye's properties set both the goal (deliver an image the eye accepts as real) and the constraints (the eye notices any mismatch in resolution, depth, latency, or motion).
Field of view matching
Because human binocular vision spans roughly 180 to 200 degrees horizontally, a headset that covers only a narrow band leaves a visible black border and reduces immersion. Consumer VR headsets typically cover around 90 to 110 degrees horizontally, well short of the full human field, and widening it is a recurring design goal that trades against lens size, weight and rendering cost.[2][7] AR optical combiners generally have a smaller field of view than VR, so virtual content occupies only a central window of the user's sight.[2]
Retinal resolution and pixels per degree
The relevant resolution metric for a headset is not total pixel count but angular pixel density, measured in pixels per degree (PPD), because the display sits a fixed angular distance from the eye through a lens. The point at which adding pixels brings no perceptible benefit is called retinal resolution or eye-limiting resolution. It was long quoted as about 60 PPD, matching the one-arcminute acuity figure, which corresponds to one pixel per arcminute.[2] The 2025 Cambridge and Meta study showed the true limit for central vision is higher, roughly 94 PPD for greyscale detail, so headsets aiming to be visually indistinguishable from reality have a higher bar than previously assumed.[11][13] Early devices were far below the target: the original Oculus DK1 reached only about 7 PPD and the first-generation HTC Vive about 11 PPD, which is why the gap between adjacent pixels was visible as the screen door effect.[2] Modern consumer headsets reach roughly the low-to-mid 20s in peak PPD, still below the eye's limit.[2]
Foveated rendering
Because acuity is concentrated in the central few degrees and falls off steeply toward the periphery, a VR system can render the area the eye is looking at in full detail and the periphery at lower detail without the user noticing, saving large amounts of rendering work. This technique is foveated rendering. The approach was demonstrated by Brian Guenter and colleagues at Microsoft Research in 2012, who modeled the eye's minimum resolvable angular size as increasing linearly with eccentricity and rendered three nested layers (a sharp inner layer plus two coarser, larger peripheral layers) around the gaze point. From a user study they fitted an acuity falloff slope of about 1.32 to 1.65 arcminutes per degree of eccentricity, and projected that on a wide, sharp future display (70 degree field at foveal acuity) foveated rendering could cut rendering cost by roughly 100 times.[7] Gaze-contingent foveated rendering depends on fast, accurate eye tracking to follow the fovea, since the high-detail region must move with the eye; excessive latency makes the change in detail visible.[7][16]
Vergence-accommodation conflict
In natural sight the eyes' convergence and the lenses' focus track the same distance together. A stereoscopic headset breaks this link: it shows two offset images so the eyes converge to a virtual depth that changes with the scene, while the actual light comes from a fixed display plane (set by the optics to a fixed focal distance, often around 1.3 to 2 meters), so the lenses must keep focusing there. The result is the vergence-accommodation conflict, in which the convergence cue and the focus cue disagree.[14][2] A controlled study by David Hoffman, Ahna Girshick, Kurt Akeley and Martin Banks, published in the Journal of Vision in 2008, showed that when focus cues are made consistent with the depicted depth, viewers identify stereoscopic stimuli faster, show better stereoacuity in time-limited tasks, see less distortion of perceived depth, and report less fatigue and discomfort; conflicting cues degrade performance and cause visual fatigue.[17] This conflict is a known source of eye strain in current VR and AR headsets and motivates research into varifocal and light-field displays that can present correct focus cues.[17][14]
Low-persistence displays
The eye's tendency to smear a held image during motion drives the use of low-persistence displays in VR. Persistence is the fraction of each frame that the panel is actually lit; in a full-persistence (sample-and-hold) display the same image stays on for the whole frame, so when the user turns their head or tracks a moving object the static frame is dragged across the retina and appears blurred. Eliminating that blur purely by raising the frame rate would require on the order of 1000 Hz, which is impractical, so VR panels instead flash each frame for only a few milliseconds and stay black the rest of the frame, which gives a sharp image during motion at a normal refresh rate.[15][18] The rule of thumb is that one millisecond of persistence produces one pixel of motion blur per 1000 pixels per second of retinal image motion.[15] Low persistence was investigated for VR by Michael Abrash at Valve and, in parallel, by Blur Busters working with Oculus, and it first shipped in an Oculus VR headset with the Oculus Rift Development Kit 2 (DK2), a developer kit released in 2014, using a strobed low-persistence OLED panel.[15][18]
Latency and interpupillary distance
Two further visual factors shape headset design. First, the visual and vestibular systems are sensitive to the delay between head movement and the corresponding image update, the motion-to-photon latency; if the image lags the head, the mismatch contributes to discomfort and motion sickness, so VR systems target very low end-to-end latency. Second, the spacing between the two eyes, the interpupillary distance (IPD), varies between people, averaging about 63 millimeters in adults with a typical range from roughly 55 to 70 millimeters. A headset's lenses and rendered eye images must be aligned to the user's IPD; a mismatch causes blur, eye strain and distorted depth, which is why many headsets provide mechanical or software IPD adjustment, commonly covering about 58 to 72 millimeters.[19][20]
References
- ↑ 1.0 1.1 "Visual System: The Eye". https://openbooks.lib.msu.edu/introneuroscience1/chapter/vision-the-retina/.
- ↑ 2.0 2.1 2.2 2.3 2.4 2.5 2.6 "Understanding Pixel Density and Retinal Resolution, and Why It's Important for VR and AR Headsets". 2017-04-21. https://www.roadtovr.com/understanding-pixel-density-retinal-resolution-and-why-its-important-for-vr-and-ar-headsets/.
- ↑ "Retina". https://en.wikipedia.org/wiki/Retina.
- ↑ 4.0 4.1 4.2 4.3 Kalloniatis, M. and Luu, C.. "Visual Acuity". https://www.webvision.pitt.edu/book/part-viii-psychophysics-of-vision/visual-acuity/.
- ↑ "Cone cell". https://en.wikipedia.org/wiki/Cone_cell.
- ↑ "Blind spot (vision)". https://en.wikipedia.org/wiki/Blind_spot_(vision).
- ↑ 7.0 7.1 7.2 7.3 7.4 7.5 Guenter, B., Finch, M., Drucker, S., Tan, D. and Snyder, J. (2012). "Foveated 3D Graphics". ACM Transactions on Graphics (SIGGRAPH Asia). https://www.microsoft.com/en-us/research/wp-content/uploads/2012/11/foveated_final15.pdf.
- ↑ "The FOV of human eyes is approximately 135 vertically and 200 horizontally". https://www.researchgate.net/figure/The-FOV-of-human-eyes-is-approximately-135-vertically-and-200-horizontally-including_fig3_263161973.
- ↑ "What Is a Human's Field of View (FOV)?". https://scienceinsights.org/what-is-a-humans-field-of-view-fov/.
- ↑ Curcio, C. A., Sloan, K. R., Kalina, R. E. and Hendrickson, A. E.(1990). "Human photoreceptor topography".{Template:Journal. 292(4). https://pubmed.ncbi.nlm.nih.gov/2324310/. Retrieved 2026-06-21.
- ↑ 11.0 11.1 Ashraf, M., Chapiro, A. and Mantiuk, R. K.(2025). "Resolution limit of the eye: how many pixels can we see?".{Template:Journal. https://www.nature.com/articles/s41467-025-64679-2. Retrieved 2026-06-21.
- ↑ "Researchers measure the resolution limit of the human eye". 2025-10-27. https://www.cst.cam.ac.uk/news/researchers-measure-resolution-limit-human-eye.
- ↑ 13.0 13.1 "Cambridge and Meta Researchers Confirm Retinal Resolution Is Far Higher Than Thought". 2025-10-27. https://www.uploadvr.com/cambridge-meta-researchers-prove-retinal-resolution-far-higher-than-60-ppd/.
- ↑ 14.0 14.1 14.2 14.3 "The geometry of the vergence-accommodation conflict in mixed reality systems". 2024. https://pmc.ncbi.nlm.nih.gov/articles/PMC11371868/.
- ↑ 15.0 15.1 15.2 15.3 "How Blur Busters Convinced Oculus Rift To Go Low Persistence". https://blurbusters.com/how-blur-busters-convinced-oculus-rift-to-go-low-persistence/.
- ↑ Albert, R., Patney, A., Luebke, D. and Kim, J. (2017). "Latency Requirements for Foveated Rendering in Virtual Reality". ACM Transactions on Applied Perception. https://research.nvidia.com/sites/default/files/pubs/2017-09_Latency-Requirements-for/a25-albert.pdf.
- ↑ 17.0 17.1 Hoffman, D. M., Girshick, A. R., Akeley, K. and Banks, M. S.(2008). "Vergence-accommodation conflicts hinder visual performance and cause visual fatigue".{Template:Journal. 8(3). https://pubmed.ncbi.nlm.nih.gov/18484839/. Retrieved 2026-06-21.
- ↑ 18.0 18.1 "Oculus Rift S Has Lower Pixel Persistence Than Original, Meaning Less Motion Blur". https://www.uploadvr.com/rift-s-low-persistence/.
- ↑ "How IPD Affects Your VR Experience and How to Adjust It". https://maeckervr.com/blogs/news/how-ipd-affects-your-vr-experience-and-how-to-adjust-it.
- ↑ "Overview of Interpupillary Distance Adjustment Technologies in VR Headsets". https://counterpointresearch.com/en/reports/overview-of-interpupillary-distance-adjustment-technologies-in-vr-headsets.