Repository logo

Design and implementation of a portable machine vision system for real-time object detection and auditory feedback

dc.contributor.advisorPillay, N
dc.contributor.advisorSingh, N
dc.contributor.authorSivate, Themba Mthembane
dc.date.accessioned2026-06-22T10:11:07Z
dc.date.available2026-06-22T10:11:07Z
dc.date.issued2024
dc.descriptionSubmitted in fulfilment of the requirements for the degree Master in Engineering: Computer Engineering, Durban University of Technology, Durban, South Africa, 2024.
dc.description.abstractA primary challenge faced by individuals with visual impairments is the difficulty or inability to perform object identification. While conventional aids such as magnifying spectacles may assist with near-field vision, individuals with reduced or absent sight often rely extensively on tactile exploration for object recognition. This reliance presents significant navigational challenges, particularly in dynamic environments such as roadways, where the increased risk of accidents is a major concern. To address this, this research proposes the development of a portable machine vision system designed to provide real-time auditory feedback regarding detected proximal objects, thereby assisting individuals with visual impairments in navigation. The design of the system prioritizes portability, reliability, modularity, and unobtrusiveness during typical operation. The hardware implementation of the proposed system consists of three key elements: a Single Board Computer (SBC), a wireless camera, and a Bluetooth-enabled earpiece. The experimental results demonstrate that the proposed system is capable of delivering real- time audio feedback of detected objects to visually impaired individuals. The system was evaluated in a real-life environment in both acceptable and poor lighting conditions. The efficacy of the proposed system in well-lit environments resulted in an average detection rate of 87.65%. However, in low-light scenes an average detection rate of 51% was observed because of the low image resolution. The minimum observed delay was 4.9 seconds, while the maximum was 10.2 seconds. This latency encompasses the duration required for image capture, processing, and audio translation. The latency can be mitigated by incorporating an integrated circuit with a dedicated Graphical Processing Unit (GPU), which is more proficient in handling machine learning and video processing tasks.
dc.description.levelM
dc.format.extent122 p
dc.identifier.doihttps://hdl.handle.net/10321/6409
dc.identifier.urihttps://hdl.handle.net/10321/6409
dc.language.isoen
dc.subjectComputer vision
dc.subjectObject detection
dc.subjectAssistive technology
dc.subjectVisual impairment
dc.subjectBlind navigation
dc.subjectMachine learning
dc.subjectEmbedded systems
dc.subjectReal-time processing
dc.subjectAudio feedback
dc.subjectWearable computing
dc.subjectArtificial intelligence
dc.subjectAccessibility technology
dc.subject.lcshComputer vision
dc.subject.lcshMachine learning
dc.subject.lcshArtificial intelligence
dc.subject.lcshComputers and people with disabilities
dc.subject.lcshSelf-help devices for people with disabilities
dc.titleDesign and implementation of a portable machine vision system for real-time object detection and auditory feedback
dc.typeThesis
local.sdgSDG03
local.sdgSDG09
local.sdgSDG10
local.sdgSDG11

Files

Original bundle

Now showing 1 - 1 of 1
Loading...
Thumbnail Image
Name:
Sivate_TM_2026.pdf
Size:
2.61 MB
Format:
Adobe Portable Document Format

License bundle

Now showing 1 - 1 of 1
Loading...
Thumbnail Image
Name:
license.txt
Size:
2.22 KB
Format:
Item-specific license agreed upon to submission
Description: