{"id":35573,"date":"2026-03-26T15:32:00","date_gmt":"2026-03-26T15:32:00","guid":{"rendered":"https:\/\/www.tun.com\/home\/?p=35573"},"modified":"2026-03-26T19:31:58","modified_gmt":"2026-03-26T19:31:58","slug":"new-ai-model-helps-machines-better-understand-3d-world","status":"publish","type":"post","link":"https:\/\/www.tun.com\/home\/new-ai-model-helps-machines-better-understand-3d-world\/","title":{"rendered":"New AI Model Helps Machines Better Understand 3D World"},"content":{"rendered":"\n<div class=\"wp-block-group\"><div class=\"wp-block-group__inner-container is-layout-constrained wp-block-group-is-layout-constrained\">\n<div class=\"wp-block-uagb-blockquote uagb-block-e7eb3fc3 uagb-blockquote__skin-border uagb-blockquote__stack-img-none\"><blockquote class=\"uagb-blockquote\"><div class=\"uagb-blockquote__content\">A new AI model from Kaunas University of Technology helps computers move beyond simply \u201cseeing\u201d objects to understanding their meaning in real-world 3D scenes. The advance could make self-driving cars, drones and digital twins safer and more reliable.<\/div><footer><div class=\"uagb-blockquote__author-wrap uagb-blockquote__author-at-left\"><\/div><\/footer><\/blockquote><\/div>\n\n\n\n<div class=\"wp-block-group is-content-justification-space-between is-nowrap is-layout-flex wp-container-core-group-is-layout-b0ffac9c wp-block-group-is-layout-flex\"><div style=\"font-size:16px\" class=\"has-text-align-left wp-block-post-author\"><div class=\"wp-block-post-author__content\"><p class=\"wp-block-post-author__name\">The University Network<\/p><\/div><\/div>\n\n\n<div class=\"wp-block-uagb-social-share uagb-social-share__outer-wrap uagb-social-share__layout-horizontal uagb-block-ee584a31\">\n<div class=\"wp-block-uagb-social-share-child uagb-ss-repeater uagb-ss__wrapper uagb-block-ec619ce7\"><span class=\"uagb-ss__link\" data-href=\"https:\/\/www.facebook.com\/sharer.php?u=\" tabindex=\"0\" role=\"button\" aria-label=\"facebook\"><span class=\"uagb-ss__source-wrap\"><span class=\"uagb-ss__source-icon\"><svg xmlns=\"https:\/\/www.w3.org\/2000\/svg\" viewBox=\"0 0 512 512\"><path d=\"M504 256C504 119 393 8 256 8S8 119 8 256c0 123.8 90.69 226.4 209.3 245V327.7h-63V256h63v-54.64c0-62.15 37-96.48 93.67-96.48 27.14 0 55.52 4.84 55.52 4.84v61h-31.28c-30.8 0-40.41 19.12-40.41 38.73V256h68.78l-11 71.69h-57.78V501C413.3 482.4 504 379.8 504 256z\"><\/path><\/svg><\/span><\/span><\/span><\/div>\n\n\n\n<div class=\"wp-block-uagb-social-share-child uagb-ss-repeater uagb-ss__wrapper uagb-block-32d99934\"><span class=\"uagb-ss__link\" data-href=\"https:\/\/twitter.com\/share?url=\" tabindex=\"0\" role=\"button\" aria-label=\"twitter\"><span class=\"uagb-ss__source-wrap\"><span class=\"uagb-ss__source-icon\"><svg xmlns=\"https:\/\/www.w3.org\/2000\/svg\" viewBox=\"0 0 512 512\"><path d=\"M389.2 48h70.6L305.6 224.2 487 464H345L233.7 318.6 106.5 464H35.8L200.7 275.5 26.8 48H172.4L272.9 180.9 389.2 48zM364.4 421.8h39.1L151.1 88h-42L364.4 421.8z\"><\/path><\/svg><\/span><\/span><\/span><\/div>\n\n\n\n<div class=\"wp-block-uagb-social-share-child uagb-ss-repeater uagb-ss__wrapper uagb-block-1d136f14\"><span class=\"uagb-ss__link\" data-href=\"https:\/\/www.linkedin.com\/shareArticle?url=\" tabindex=\"0\" role=\"button\" aria-label=\"linkedin\"><span class=\"uagb-ss__source-wrap\"><span class=\"uagb-ss__source-icon\"><svg xmlns=\"https:\/\/www.w3.org\/2000\/svg\" viewBox=\"0 0 448 512\"><path d=\"M416 32H31.9C14.3 32 0 46.5 0 64.3v383.4C0 465.5 14.3 480 31.9 480H416c17.6 0 32-14.5 32-32.3V64.3c0-17.8-14.4-32.3-32-32.3zM135.4 416H69V202.2h66.5V416zm-33.2-243c-21.3 0-38.5-17.3-38.5-38.5S80.9 96 102.2 96c21.2 0 38.5 17.3 38.5 38.5 0 21.3-17.2 38.5-38.5 38.5zm282.1 243h-66.4V312c0-24.8-.5-56.7-34.5-56.7-34.6 0-39.9 27-39.9 54.9V416h-66.4V202.2h63.7v29.2h.9c8.9-16.8 30.6-34.5 62.9-34.5 67.2 0 79.7 44.3 79.7 101.9V416z\"><\/path><\/svg><\/span><\/span><\/span><\/div>\n<\/div>\n<\/div>\n<\/div><\/div>\n\n\n\n<p class=\"wp-block-paragraph\">Self-driving cars that can spot a partially hidden pedestrian at dusk. Drones that can safely weave through crowded city streets. Digital twins of entire cities that update in near real time.<\/p>\n\n\n\n<p class=\"wp-block-paragraph\">A team at Kaunas University of Technology (KTU) in Lithuania has developed an artificial intelligence model that brings these scenarios closer to everyday reality by helping machines understand the 3D world more like humans do.<\/p>\n\n\n\n<p class=\"wp-block-paragraph\">The new model tackles one of the toughest problems in computer vision: making sense of 3D point clouds, the millions of data points that laser sensors collect when they scan streets, forests or buildings.<\/p>\n\n\n\n<p class=\"wp-block-paragraph\">KTU professor Rytis Maskeli\u016bnas explained the basic idea behind this technology.<\/p>\n\n\n\n<p class=\"wp-block-paragraph\">\u201cImagine taking millions of precise laser measurements of a physical space, like a street, a forest, or an entire city, and stitching them together to create a detailed three-dimensional map made up of individual points. This is known as a 3D point cloud. The technology used to analyse it focuses on helping computers understand the shapes of objects in the map and interpret their context within the scene,\u201d Maskeli\u016bnas said in a news release.<\/p>\n\n\n\n<p class=\"wp-block-paragraph\">These 3D maps already underpin many tools we use every day, from driver-assistance systems in modern cars to detailed digital models of cities used for urban planning and infrastructure monitoring.<\/p>\n\n\n\n<p class=\"wp-block-paragraph\">KTU researcher Sarmad Maqsood noted that most people are surrounded by this technology without realizing it. <\/p>\n\n\n\n<p class=\"wp-block-paragraph\">\u201cAn average person regularly encounters the underlying 3D data and technologies similar to those described in our work without even realising it,\u201d Maqsood said in the news release.<\/p>\n\n\n\n<h3 class=\"wp-block-heading\">Cars, cities and digital twins<\/h3>\n\n\n\n<p class=\"wp-block-paragraph\">In today\u2019s vehicles, sensors and 3D data help power features such as automatic emergency braking and adaptive cruise control. These systems need to distinguish between pedestrians, cyclists, vehicles and road edges, often in poor weather, low light or crowded environments.<\/p>\n\n\n\n<p class=\"wp-block-paragraph\">Beyond transportation, 3D point clouds are used to build high-resolution digital models of urban areas. These models support so-called digital twins \u2014 virtual replicas of real-world environments that can be updated continuously to track changes in buildings, roads, vegetation and more.<\/p>\n\n\n\n<p class=\"wp-block-paragraph\">But teaching computers to read these complex 3D scenes is far from simple.<\/p>\n\n\n\n<p class=\"wp-block-paragraph\">\u201cComputers face significant difficulties in analysing 3D point clouds primarily because this data type is inherently irregular, unstructured, and massive,\u201d Maqsood added.<\/p>\n\n\n\n<p class=\"wp-block-paragraph\">In a point cloud, nearby objects may be captured with many dense points, while distant ones are represented sparsely. Important elements such as pedestrians or small obstacles can appear far less frequently than dominant surfaces like roads or building facades. On top of that, real-world data is full of noise and occlusions, where objects block each other from view.<\/p>\n\n\n\n<p class=\"wp-block-paragraph\">All of this makes it hard for algorithms to reliably identify and label each point as part of a road, tree, vehicle, person or other object \u2014 especially when decisions need to be made in real time for safety-critical systems.<\/p>\n\n\n\n<h3 class=\"wp-block-heading\">A hybrid model that sees both detail and context<\/h3>\n\n\n\n<p class=\"wp-block-paragraph\">To overcome these challenges, the KTU team designed a model that blends several ways of analyzing 3D data into a single, unified system.<\/p>\n\n\n\n<p class=\"wp-block-paragraph\">Traditional approaches often focus either on local details \u2014 the fine-grained shape of a curb or a car bumper \u2014 or on the global structure of a scene, such as the overall layout of a street. The new model is built to do both at once.<\/p>\n\n\n\n<p class=\"wp-block-paragraph\">At its core is a transformer-based method, a type of AI architecture originally popularized in natural language processing. In this context, transformers help the system capture relationships across an entire 3D scene, rather than treating each region in isolation.<\/p>\n\n\n\n<p class=\"wp-block-paragraph\">On top of that, the model includes mechanisms that deliberately emphasize rare but important features. That makes it better at handling imbalanced data, where small or less frequent objects might otherwise be overlooked.<\/p>\n\n\n\n<p class=\"wp-block-paragraph\">Maskeli\u016bnas compared the challenge to sorting out a chaotic 3D jigsaw. <\/p>\n\n\n\n<p class=\"wp-block-paragraph\">\u201cImagine you have a massive, messy 3D puzzle made of millions of points that needs to be sorted into meaningful objects like roads, trees, and pedestrians. Our model acts like a highly intelligent and efficient puzzle-solver,\u201d he said.<\/p>\n\n\n\n<p class=\"wp-block-paragraph\">By learning how points relate to one another across the whole scene and by boosting the signal from underrepresented objects, the system improves detection of small, partially hidden or sparsely captured items that older methods might miss.<\/p>\n\n\n\n<h3 class=\"wp-block-heading\">Seeing the person in the noise<\/h3>\n\n\n\n<p class=\"wp-block-paragraph\">This ability becomes crucial in real-world scenarios, such as an autonomous vehicle approaching an intersection at dusk. In that situation, a pedestrian might only appear as a handful of scattered points, partially obscured by a parked car or street furniture.<\/p>\n\n\n\n<p class=\"wp-block-paragraph\">Instead of treating those few points as meaningless noise, the KTU model uses context to infer what is really there. It relates the sparse data to nearby structures like a pole, sidewalk or crosswalk and infers that those points likely belong to a person.<\/p>\n\n\n\n<p class=\"wp-block-paragraph\">This contextual reasoning could have a direct impact on safety, according to Maskeli\u016bnas.<\/p>\n\n\n\n<p class=\"wp-block-paragraph\">\u201cInstead of missing this information, the model interprets it in context \u2013 relating sparse signals to surrounding elements such as a pole or a crosswalk \u2013 and identifies the presence of a person even when the data is incomplete. This ability to interpret context from limited information could significantly improve safety in autonomous systems,\u201d said Maskeli\u016bnas.<\/p>\n\n\n\n<p class=\"wp-block-paragraph\">Crucially, the system is designed to be efficient as well as accurate. According to the researchers, it can process complex 3D scenes in just over two seconds per frame while maintaining high performance.<\/p>\n\n\n\n<p class=\"wp-block-paragraph\">Maqsood emphasized that the technical advance is not only about better segmentation \u2014 the task of assigning each point to a category like road, tree or pedestrian \u2014 but also about how the entire workflow is streamlined. <\/p>\n\n\n\n<p class=\"wp-block-paragraph\">\u201cBeyond segmentation accuracy, a key achievement is the demonstration of an efficient, unified pipeline,\u201d he said.<\/p>\n\n\n\n<p class=\"wp-block-paragraph\">The model integrates compression and transmission into the process, allowing large-scale 3D data to be handled and shared in near real time without losing essential detail. That is important for applications where data must be sent between vehicles, drones, servers or city management systems.<\/p>\n\n\n\n<h3 class=\"wp-block-heading\">What comes next<\/h3>\n\n\n\n<p class=\"wp-block-paragraph\">While the work is immediately relevant to autonomous driving and smart cities, the researchers see many other potential uses.<\/p>\n\n\n\n<p class=\"wp-block-paragraph\">Delivery drones could navigate cluttered, unpredictable environments more safely. Robots in search-and-rescue missions could better interpret collapsed structures or debris fields from sparse sensor data. Archaeologists might reconstruct ruins from limited scans, and forensic investigators could analyze subtle spatial details at crime or accident scenes.<\/p>\n\n\n\n<p class=\"wp-block-paragraph\">The same underlying capability \u2014 turning messy, incomplete 3D measurements into meaningful understanding \u2014 could also power more advanced augmented reality, where digital content is precisely anchored to complex real-world spaces.<\/p>\n\n\n\n<p class=\"wp-block-paragraph\">As 3D sensing becomes cheaper and more widespread, from lidar-equipped phones to city-scale mapping projects, tools like the KTU model may become a key part of how machines perceive and manage the built and natural environment.<\/p>\n\n\n\n<p class=\"wp-block-paragraph\">The team\u2019s study, <a href=\"https:\/\/www.sciencedirect.com\/science\/article\/pii\/S2352938526000248\" target=\"_blank\" rel=\"noopener\" title=\"\">published<\/a> in the journal <em>Remote Sensing of Environment<\/em>, suggests that machines are moving beyond simply capturing the world in three dimensions. They are beginning to interpret it in ways that are closer to how people see and understand their surroundings.<\/p>\n\n\n\n<div style=\"height:10px\" aria-hidden=\"true\" class=\"wp-block-spacer\"><\/div>\n\n\n\n<p class=\"wp-block-paragraph\"><strong>Source:<\/strong> <a href=\"https:\/\/en.ktu.edu\/news\/ktu-researchers-develop-a-model-that-improves-machine-understanding-of-the-real-world\/\" target=\"_blank\" rel=\"noopener\" title=\"\">Kaunas University of Technology<\/a><\/p>\n","protected":false},"excerpt":{"rendered":"<p>A new model from Kaunas University of Technology helps computers move beyond simply \u201cseeing\u201d objects to understanding their meaning in real-world 3D scenes. The advance could make self-driving cars, drones and digital twins safer and more reliable.<\/p>\n","protected":false},"author":3,"featured_media":0,"comment_status":"open","ping_status":"open","sticky":false,"template":"single-no-separators","format":"standard","meta":{"_acf_changed":false,"_uag_custom_page_level_css":"","_monsterinsights_skip_tracking":false,"_monsterinsights_sitenote_active":false,"_monsterinsights_sitenote_note":"","_monsterinsights_sitenote_category":0,"footnotes":""},"categories":[8],"tags":[607],"class_list":["post-35573","post","type-post","status-publish","format-standard","hentry","category-ai","tag-kaunas-university-of-technology"],"acf":[],"aioseo_notices":[],"uagb_featured_image_src":{"full":false,"thumbnail":false,"medium":false,"medium_large":false,"large":false,"1536x1536":false,"2048x2048":false},"uagb_author_info":{"display_name":"The University Network","author_link":"https:\/\/www.tun.com\/home\/author\/funky_junkie\/"},"uagb_comment_info":0,"uagb_excerpt":"A new model from Kaunas University of Technology helps computers move beyond simply \u201cseeing\u201d objects to understanding their meaning in real-world 3D scenes. The advance could make self-driving cars, drones and digital twins safer and more reliable.","_links":{"self":[{"href":"https:\/\/www.tun.com\/home\/wp-json\/wp\/v2\/posts\/35573","targetHints":{"allow":["GET"]}}],"collection":[{"href":"https:\/\/www.tun.com\/home\/wp-json\/wp\/v2\/posts"}],"about":[{"href":"https:\/\/www.tun.com\/home\/wp-json\/wp\/v2\/types\/post"}],"author":[{"embeddable":true,"href":"https:\/\/www.tun.com\/home\/wp-json\/wp\/v2\/users\/3"}],"replies":[{"embeddable":true,"href":"https:\/\/www.tun.com\/home\/wp-json\/wp\/v2\/comments?post=35573"}],"version-history":[{"count":10,"href":"https:\/\/www.tun.com\/home\/wp-json\/wp\/v2\/posts\/35573\/revisions"}],"predecessor-version":[{"id":35590,"href":"https:\/\/www.tun.com\/home\/wp-json\/wp\/v2\/posts\/35573\/revisions\/35590"}],"wp:attachment":[{"href":"https:\/\/www.tun.com\/home\/wp-json\/wp\/v2\/media?parent=35573"}],"wp:term":[{"taxonomy":"category","embeddable":true,"href":"https:\/\/www.tun.com\/home\/wp-json\/wp\/v2\/categories?post=35573"},{"taxonomy":"post_tag","embeddable":true,"href":"https:\/\/www.tun.com\/home\/wp-json\/wp\/v2\/tags?post=35573"}],"curies":[{"name":"wp","href":"https:\/\/api.w.org\/{rel}","templated":true}]}}