Vítor Fonseca

Vítor Fonseca

I am a final-year BSc student in Aerospace Engineering at Instituto Superior Técnico (IST Lisboa), where I focus on robotic systems. I am currently a team member of SocRob@Home at the Institute for Systems and Robotics (ISR Lisboa), and I manage the Autonomous Systems department of Técnico Fuel Cell (TFC), a student-led association. My research at ISR focuses on Embodied AI using Visual Language Models (VLMs) and 3D point cloud reconstruction.

Semantic Reconstruction

A ROS 2 package for semantic mapping in domestic environments. The pipeline starts in a Gazebo-simulated environment. When the camera is oriented toward the desired surfaces, object detection and segmentation tools are used (currently YOLO with the COCO dataset, soon to be SAM 3). Based on the 2D segmented images, the system extracts 3D point clouds for each detected class. These point clouds are filtered and reconstructed into 3D meshes using Open3D. The meshes are saved in map coordinates and published as a MarkerArray for visualization in RViz. Different colors are assigned to different object classes in order to distinguish them semantically. This project is still in development for perception and manipulation tasks for RoboCup 2026. Click on 'View Project' to see the repository.

View project

AV Gazebo Simulation

Gazebo simulation designed to model sensors, actuators, and the track in order to validate the car's autonomy stack. The work included URDF, Gazebo controllers, a detection model, architecture optimization, sensor design, and calibration.

More Info

Manipulator AMR

Personal project focused on designing an Autonomous Mobile Robot with manipulation capabilities. The goal is to expand my knowledge of reinforcement learning techniques for autonomous driving and manipulation.

View project

VLM ROS2 bridge and optimization

Visual Language Model (Chatrex) ROS 2 bridge. This project demonstrates object detection with real-time performance. The prompt is "Please detect chairs; sofas; sitting people in this image. List each one with a different object index." As shown, the model successfully outputs the bounding boxes for the requested prompt in under 1 second. The process uses cloud AI to run the model and then sends the inference results to the robot running ROS 2.

More Info

Drone Prototyping

Quadrotor design, from avionics architecture to CAD and manufacturing. The drone was designed for autonomous rescue applications, detecting a person and bridging communication with support services.

More Info