{"id":13327,"date":"2024-12-27T18:22:25","date_gmt":"2024-12-27T18:22:25","guid":{"rendered":"https:\/\/www.tun.com\/home\/?p=13327"},"modified":"2024-12-27T18:22:26","modified_gmt":"2024-12-27T18:22:26","slug":"seoultech-researchers-develop-pv2doc-to-convert-presentation-videos-into-summarized-documents","status":"publish","type":"post","link":"https:\/\/www.tun.com\/home\/seoultech-researchers-develop-pv2doc-to-convert-presentation-videos-into-summarized-documents\/","title":{"rendered":"SeoulTech Researchers Develop PV2DOC to Convert Presentation Videos Into Summarized Documents"},"content":{"rendered":"\n<div class=\"wp-block-group\"><div class=\"wp-block-group__inner-container is-layout-constrained wp-block-group-is-layout-constrained\">\n<div class=\"wp-block-uagb-blockquote uagb-block-e7eb3fc3 uagb-blockquote__skin-border uagb-blockquote__stack-img-none\"><blockquote class=\"uagb-blockquote\"><div class=\"uagb-blockquote__content\">Researchers at Seoul National University of Science and Technology have developed PV2DOC, a revolutionary tool to transform presentation videos into summarized, structured documents, enhancing accessibility and efficiency.<\/div><footer><div class=\"uagb-blockquote__author-wrap uagb-blockquote__author-at-left\"><\/div><\/footer><\/blockquote><\/div>\n\n\n\n<div class=\"wp-block-group is-content-justification-space-between is-nowrap is-layout-flex wp-container-core-group-is-layout-0dfbf163 wp-block-group-is-layout-flex\"><div style=\"font-size:16px;\" class=\"has-text-align-left wp-block-post-author\"><div class=\"wp-block-post-author__content\"><p class=\"wp-block-post-author__name\">The University Network<\/p><\/div><\/div>\n\n\n<div class=\"wp-block-uagb-social-share uagb-social-share__outer-wrap uagb-social-share__layout-horizontal uagb-block-ee584a31\">\n<div class=\"wp-block-uagb-social-share-child uagb-ss-repeater uagb-ss__wrapper uagb-block-ec619ce7\"><span class=\"uagb-ss__link\" data-href=\"https:\/\/www.facebook.com\/sharer.php?u=\" tabindex=\"0\" role=\"button\" aria-label=\"facebook\"><span class=\"uagb-ss__source-wrap\"><span class=\"uagb-ss__source-icon\"><svg xmlns=\"https:\/\/www.w3.org\/2000\/svg\" viewBox=\"0 0 512 512\"><path d=\"M504 256C504 119 393 8 256 8S8 119 8 256c0 123.8 90.69 226.4 209.3 245V327.7h-63V256h63v-54.64c0-62.15 37-96.48 93.67-96.48 27.14 0 55.52 4.84 55.52 4.84v61h-31.28c-30.8 0-40.41 19.12-40.41 38.73V256h68.78l-11 71.69h-57.78V501C413.3 482.4 504 379.8 504 256z\"><\/path><\/svg><\/span><\/span><\/span><\/div>\n\n\n\n<div class=\"wp-block-uagb-social-share-child uagb-ss-repeater uagb-ss__wrapper uagb-block-32d99934\"><span class=\"uagb-ss__link\" data-href=\"https:\/\/twitter.com\/share?url=\" tabindex=\"0\" role=\"button\" aria-label=\"twitter\"><span class=\"uagb-ss__source-wrap\"><span class=\"uagb-ss__source-icon\"><svg xmlns=\"https:\/\/www.w3.org\/2000\/svg\" viewBox=\"0 0 512 512\"><path d=\"M389.2 48h70.6L305.6 224.2 487 464H345L233.7 318.6 106.5 464H35.8L200.7 275.5 26.8 48H172.4L272.9 180.9 389.2 48zM364.4 421.8h39.1L151.1 88h-42L364.4 421.8z\"><\/path><\/svg><\/span><\/span><\/span><\/div>\n\n\n\n<div class=\"wp-block-uagb-social-share-child uagb-ss-repeater uagb-ss__wrapper uagb-block-1d136f14\"><span class=\"uagb-ss__link\" data-href=\"https:\/\/www.linkedin.com\/shareArticle?url=\" tabindex=\"0\" role=\"button\" aria-label=\"linkedin\"><span class=\"uagb-ss__source-wrap\"><span class=\"uagb-ss__source-icon\"><svg xmlns=\"https:\/\/www.w3.org\/2000\/svg\" viewBox=\"0 0 448 512\"><path d=\"M416 32H31.9C14.3 32 0 46.5 0 64.3v383.4C0 465.5 14.3 480 31.9 480H416c17.6 0 32-14.5 32-32.3V64.3c0-17.8-14.4-32.3-32-32.3zM135.4 416H69V202.2h66.5V416zm-33.2-243c-21.3 0-38.5-17.3-38.5-38.5S80.9 96 102.2 96c21.2 0 38.5 17.3 38.5 38.5 0 21.3-17.2 38.5-38.5 38.5zm282.1 243h-66.4V312c0-24.8-.5-56.7-34.5-56.7-34.6 0-39.9 27-39.9 54.9V416h-66.4V202.2h63.7v29.2h.9c8.9-16.8 30.6-34.5 62.9-34.5 67.2 0 79.7 44.3 79.7 101.9V416z\"><\/path><\/svg><\/span><\/span><\/span><\/div>\n<\/div>\n<\/div>\n<\/div><\/div>\n\n\n\n<p>Researchers at Seoul National University of Science and Technology led by Hyuk-Yoon Kwon, an associate professor in the Department of Industrial &amp; Information Systems Engineering, have announced a pioneering tool that could revolutionize how we consume and manage presentation-style video content. Named PV2DOC, this innovative software converts lengthy presentation videos into concise, structured documents, enabling users to access and comprehend critical information more efficiently.<\/p>\n\n\n\n<p>Presentation videos combining slides, graphics and spoken explanations have surged in popularity, especially during the COVID-19 pandemic. While engaging, these videos are often cumbersome, requiring viewers to sit through entire recordings to glean specific details and occupying significant storage space.<\/p>\n\n\n\n<p>PV2DOC addresses these pain points by transforming video data into organized PDFs, effectively consolidating both audio and visual elements. Unlike existing summarizers that need a transcript and become ineffective without one, PV2DOC excels by extracting and merging data directly from the video itself.<\/p>\n\n\n\n<p>\u201cFor users who need to watch and study numerous videos, such as lectures or conference presentations, PV2DOC generates summarized reports that can be read within two minutes,\u201d Kwon said in a <a href=\"https:\/\/www.eurekalert.org\/news-releases\/1069199\" target=\"_blank\" rel=\"noopener\" title=\"\">news release<\/a>. \u201cAdditionally, PV2DOC manages figures and tables separately, connecting them to the summarized content so users can refer to them when needed.\u201d<\/p>\n\n\n\n<p>PV2DOC operates through a multi-step process involving advanced image and audio processing techniques. The tool captures video frames at one-second intervals, identifying unique visuals using the structural similarity index. It then applies object detection models \u2014 Mask R-CNN and YOLOv5 \u2014 to recognize figures, tables and other key elements. Any fragmented images are combined using a figure merge technique.<\/p>\n\n\n\n<p>For text extraction, the software leverages Google\u2019s Tesseract engine for optical character recognition (OCR), organizing the extracted text into structured formats with headings and paragraphs. Simultaneously, audio content is transcribed using the Whisper model, an open-source speech-to-text tool. The transcribed text is then summarized using the TextRank algorithm.<\/p>\n\n\n\n<p>The result is a Markdown document convertible into a PDF, presenting the video&#8217;s information in a clear, accessible manner that aligns with the video\u2019s original structure.<\/p>\n\n\n\n<p>\u201cThis software simplifies data storage and facilitates data analysis for presentation videos by transforming unstructured data into a structured format, thus offering significant potential from the perspectives of information accessibility and data management,\u201d Kwon added. \u201cIt provides a foundation for more efficient utilization of presentation videos.\u201d<\/p>\n\n\n\n<p>Looking ahead, the research team plans to enhance PV2DOC further by training a large language model, akin to ChatGPT. The goal is to offer a question-answering service, allowing users to interact with the video content more dynamically and obtain accurate, contextually-relevant responses to their queries.<\/p>\n\n\n\n<p>The development of PV2DOC marks a significant step forward in information technology, promising to streamline the consumption and storage of presentation videos. Its capacity to transform vast amounts of unstructured video data into manageable, searchable documents could have extensive applications in educational, corporate and research settings worldwide.<\/p>\n\n\n\n<p><\/p>\n","protected":false},"excerpt":{"rendered":"<p>Researchers at Seoul National University of Science and Technology led by Hyuk-Yoon Kwon, an associate professor in the Department of Industrial &amp; Information Systems Engineering, have announced a pioneering tool that could revolutionize how we consume and manage presentation-style video content. Named PV2DOC, this innovative software converts lengthy presentation videos into concise, structured documents, enabling [&hellip;]<\/p>\n","protected":false},"author":3,"featured_media":0,"comment_status":"open","ping_status":"open","sticky":false,"template":"single-no-separators","format":"standard","meta":{"_acf_changed":false,"_uag_custom_page_level_css":"","_monsterinsights_skip_tracking":false,"_monsterinsights_sitenote_active":false,"_monsterinsights_sitenote_note":"","_monsterinsights_sitenote_category":0,"footnotes":""},"categories":[17],"tags":[],"class_list":["post-13327","post","type-post","status-publish","format-standard","hentry","category-tech"],"acf":[],"aioseo_notices":[],"uagb_featured_image_src":{"full":false,"thumbnail":false,"medium":false,"medium_large":false,"large":false,"1536x1536":false,"2048x2048":false},"uagb_author_info":{"display_name":"The University Network","author_link":"https:\/\/www.tun.com\/home\/author\/funky_junkie\/"},"uagb_comment_info":0,"uagb_excerpt":"Researchers at Seoul National University of Science and Technology led by Hyuk-Yoon Kwon, an associate professor in the Department of Industrial &amp; Information Systems Engineering, have announced a pioneering tool that could revolutionize how we consume and manage presentation-style video content. Named PV2DOC, this innovative software converts lengthy presentation videos into concise, structured documents, enabling&hellip;","_links":{"self":[{"href":"https:\/\/www.tun.com\/home\/wp-json\/wp\/v2\/posts\/13327","targetHints":{"allow":["GET"]}}],"collection":[{"href":"https:\/\/www.tun.com\/home\/wp-json\/wp\/v2\/posts"}],"about":[{"href":"https:\/\/www.tun.com\/home\/wp-json\/wp\/v2\/types\/post"}],"author":[{"embeddable":true,"href":"https:\/\/www.tun.com\/home\/wp-json\/wp\/v2\/users\/3"}],"replies":[{"embeddable":true,"href":"https:\/\/www.tun.com\/home\/wp-json\/wp\/v2\/comments?post=13327"}],"version-history":[{"count":8,"href":"https:\/\/www.tun.com\/home\/wp-json\/wp\/v2\/posts\/13327\/revisions"}],"predecessor-version":[{"id":13340,"href":"https:\/\/www.tun.com\/home\/wp-json\/wp\/v2\/posts\/13327\/revisions\/13340"}],"wp:attachment":[{"href":"https:\/\/www.tun.com\/home\/wp-json\/wp\/v2\/media?parent=13327"}],"wp:term":[{"taxonomy":"category","embeddable":true,"href":"https:\/\/www.tun.com\/home\/wp-json\/wp\/v2\/categories?post=13327"},{"taxonomy":"post_tag","embeddable":true,"href":"https:\/\/www.tun.com\/home\/wp-json\/wp\/v2\/tags?post=13327"}],"curies":[{"name":"wp","href":"https:\/\/api.w.org\/{rel}","templated":true}]}}