Mr. Zushuang Liang | Salient Object Detection | Best Researcher Award

Mr. Zushuang Liang | Salient Object Detection | Best Researcher Award

Mr. Zushuang Liang, Harbin Institute of Technology, China

Mr. Zushuang Liang is a graduate student at the Harbin Institute of Technology, specializing in Computer Vision with a focus on Salient Object Detection and Graph Neural Networks (GNNs). His innovative research, including the development of a multi-scale graph attention network for video detection, holds promising applications in areas such as autonomous driving and surveillance. Additionally, Mr. Liang explores interdisciplinary work by integrating machine learning with music technology through piano polyphonic transcription, showcasing his versatility and contribution to both fields.

Professional Profile:

Orcid

Suitability for the Best Researcher Award:

Mr. Liangā€™s work is not only technically innovative but also highly impactful. His contributions to video salient object detection with applications that extend to fields like autonomous driving, surveillance, and multimedia retrieval make him a deserving candidate for the Best Researcher Award. His interdisciplinary approach, combining machine learning with music technology, further distinguishes him as a forward-thinking researcher.

Educational Background:

He earned his Bachelorā€™s Degree from the Harbin Institute of Technology and is currently pursuing a Masterā€™s degree at the same institution, within the School of Electronics and Information Engineering.

Area of Specialization:

Mr. Zushuang Liang specializes in Computer Vision with a focus on Salient Object Detection and Graph Neural Networks (GNNs). His work revolves around enhancing video detection accuracy by applying innovative techniques in multi-scale graph attention networks.

Research & Contributions:

His pioneering research includes developing the multi-scale graph attention network for video salient object detection, with potential applications in autonomous driving and surveillance. Additionally, he bridges disciplines by working on piano polyphonic transcription, integrating machine learning with music technology.

Publication Top Notes:

Title: DAFE-MSGAT: Dual-Attention Feature Extraction and Multi-Scale Graph Attention Network for Polyphonic Piano Transcription
  • Year: 2024

 

 

Dr. Jiangning Zhang | Computer Vision | Best Researcher Award

Dr. Jiangning Zhang | Computer Vision | Best Researcher Award

Dr. Jiangning Zhang, Zhejiang University, China

Dr. Jiangning Zhang, a Principal Researcher at YouTu Lab, Tencent in Shanghai, holds a Ph.D. in Control Science and Engineering and an M.D. from Zhejiang University, as well as a B.S. in Electronic Information from Wuhan University. He leads the Industry Perception and AIGC teams, focusing on neural architecture design, particularly transformer-based and lightweight vision models. Dr. Zhang’s research spans multi-modal AIGC, including image and video generation, human-centric editing, and virtual digital human technologies, and extends to 3D scene segmentation with foundation models for visual anomaly classification. His work drives advancements in cutting-edge technologies and applications in artificial intelligence and computer vision.

Professional Profile:

Google Scholar

Suitability for the Award

Dr. Jiangning Zhang is highly suitable for the Best Researcher Award for the following reasons:

  1. Innovative Research Contributions:
    • Dr. Zhangā€™s work in neural architecture design and multi-modal generative models has advanced the field of computer vision and AI. His research on transformer-based architectures and lightweight vision models contributes to the efficiency and effectiveness of AI systems.
    • His involvement in developing multi-modal GANs, VAEs, and diffusion models for various applications, including 2D/3D digital human generation and motion generation, demonstrates significant innovation and impact in the field.
  2. High-Impact Publications:
    • Dr. Zhang has published influential papers in top-tier conferences such as ICCV and TIP, showcasing his contributions to cutting-edge research. His work on efficient attention-based models and anomaly detection reflects his expertise and impact on the academic community.
  3. Leadership and Influence:
    • As a Principal Researcher at Tencentā€™s YouTu Lab, Dr. Zhang leads teams working on industry perception and advanced AI-generated content, indicating his role in shaping future technologies and influencing both academic and industrial research.
  4. Relevance and Application:
    • Dr. Zhangā€™s research addresses practical and emerging challenges in AI, such as image/video generation, digital human reconstruction, and scene segmentation. His work has practical applications in various industries, enhancing its relevance and impact.

Educational BackgroundĀ 

Dr. Zhang earned his Ph.D. in Control Science and Engineering from Zhejiang University, Hangzhou, China, under the guidance of Prof. Yong Liu. He pursued his M.D. at the same institution and obtained his B.S. in Electronic Information from Wuhan University.

Current Position

Dr. Jiangning Zhang is a Principal Researcher leading the Industry Perception and AIGC teams at YouTu Lab, Tencent, Shanghai.

Research InterestsĀ 

  • Neural Architecture Design: Specializing in transformer-based architectures and lightweight vision models.
  • Multi-modal AIGC Research: Investigating image and video generation, human-centric editing, and 2D/3D virtual digital human technologies, including reconstruction and animation.
  • 3D Scene Segmentation: Utilizing foundation models for visual anomaly classification and segmentation.

Publication Top Notes:

  • Title: Omni-Frequency Channel-Selection Representations for Unsupervised Anomaly Detection
    • Year: 2023
    • Cited by: 96
  • Title: Rethinking Mobile Block for Efficient Attention-Based Models
    • Year: 2023
    • Cited by: 78
  • Title: Towards Open Vocabulary Learning: A Survey
    • Year: 2024
    • Cited by: 71
  • Title: Multimodal Industrial Anomaly Detection via Hybrid Fusion
    • Year: 2023
    • Cited by: 64
  • Title: Region-Aware Face Swapping
    • Year: 2022
    • Cited by: 56

 

 

Mr. Xiaoyin Zheng | Computer Vision Awards | Best Researcher Award

Mr. Xiaoyin Zheng | Computer Vision Awards | Best Researcher Award

Mr. Xiaoyin Zheng, XMotors.ai, United States

Mr. Xiaoyin Zheng is a skilled Computer Vision Algorithm Engineer at XMotors.ai, where he focuses on integrating deep learning for advanced cabin monitoring and driver state analysis. With an M.S. in Engineering Technology from Purdue University and a Bachelor’s in Automotive Engineering from Wuhan University of Technology, Xiaoyin excels in Python, C, C++, and MATLAB/Simulink. His technical expertise encompasses computer vision and deep learning, particularly in object classification, detection, and tracking. Xiaoyinā€™s notable research includes autonomous vehicle simulators and lithium battery estimation, and he has achieved recognition with a first prize at the GM Tech Center competition. Additionally, his experience as a Graduate Teaching Assistant at Purdue University underscores his dedication to advancing engineering education and technology.

šŸŒ Professional Profile:
Google Scholar

šŸŽ“ Education:

Xiaoyin Zheng earned his M.S. in Engineering Technology from Purdue University, specializing in robotics and self-driving technology. He also holds a Bachelor’s in Automotive Engineering from Wuhan University of Technology.

šŸ”¬ Technical Skills:

Xiaoyin is proficient in Python, C, C++, and MATLAB/Simulink. His expertise includes computer vision and deep learning, with a focus on object classification, detection, segmentation, tracking, and model acceleration.

šŸ’¼ Professional Experience:

Xiaoyin is currently a Computer Vision Algorithm Engineer at XMotors.ai, where he integrates deep learning into cabin monitoring systems and improves driver state monitoring through advanced data processing. Previously, he interned as a Mechanical Engineer at Along Aircraft Manufacturing Company, where he worked on airplane mechanical parts and fly test approvals.

šŸ”¬ Research Experience:

His research includes developing simulators for autonomous vehicle dynamics and lithium battery state-of-charge estimation using extended Kalman filtering. He also contributed to designing a foldable personal mobility device, winning first prize at the GM Tech Center competition.

šŸ“š Teaching Experience:

As a Graduate Teaching Assistant at Purdue University, Xiaoyin taught Automated Manufacturing Processes and Applied Statics, guiding students in CNC operations, CAD design, and fundamental engineering concepts.

šŸ† Achievements:

Xiaoyin’s innovative work in robotics and autonomous systems, coupled with his successful research and teaching roles, highlights his commitment to advancing engineering technology and education.

Publication Top Notes:

Lithium battery soc estimation based on extended kalman filtering algorithm
  • Year: 2018
  • Citations: 27
Multi-scale fractal characteristics of the pore system in low-permeability conglomerates from the junggar basin
  • Year: 2023
  • Citations: 2
Anything in Any Scene: Photorealistic Video Object Insertion
  • Year: 2024
A Minimal Set of Parameters Based Depth-Dependent Distortion Model and Its Calibration Method for Stereo Vision Systems
  • Year: 2024

 

 

 

Mr. Saket Kumar Singh | Computer Vision | Best Researcher Award

Mr. Saket Kumar Singh | Computer Vision | Best Researcher Award

Mr. Saket Kumar Singh, Birla Institute of Technology, Mesra, Ranchi, India

Saket Kumar Singh, born on October 21, 1993, is a Project Research Scientist currently working on an ICMR-funded project titled ā€œExplainable AI for Hypoxic Ischemic Encephalopathy Detection using ultrasound images in Jharkhand Neonates: A Deep Learning Approach.ā€ He is pursuing a Ph.D. in Computer Science and Engineering at BIT MESRA, Ranchi, focusing on implementing deep learning methods for the early diagnosis of chronic diseases. Singh has over seven years of experience as an Assistant Professor, teaching AI, Machine Learning, and Data Science at various esteemed institutions in India, including the National Institute of Advanced Manufacturing Technology, AKG Engineering College, IMS Engineering College, Kanpur Institute of Technology, and Subhash Institute of Software Technology. He holds an MTech in Computer Science from BIT MESRA, where he graduated with distinction, and a B.E. in Computer Science & Engineering from NRI Institute of Information Science & Tech., Bhopal. Saket is recognized for his technical, experimental, and interpersonal skills, making significant contributions to curriculum development, student mentoring, and organizing hackathons. He has a strong passion for technology and research, with numerous publications and project presentations to his name. Additionally, Saket has received several accolades in cultural and sports activities, showcasing his diverse talents and dedication to excellence.

šŸŒ Professional Profile:

 

ORCID

 

Academic Background šŸŽ“šŸ“–

  • Ph.D. (Pursuing) in Computer Science and Engineering, BIT MESRA, Ranchi (2021 ā€“ Present)
    • Thesis Title: Implementation of Deep Learning Methods for early diagnosis of various chronic diseases
  • MTech in Computer Science, BIT MESRA, Ranchi (2013-2015)
    • Percentage: 80.8% (First Class with Distinction)
  • B.E. in Computer Science & Engineering, NRI Institute of Information Science & Technology, Bhopal (2009-2013)
    • Percentage: 67.25% (First Class)
  • Schooling at D.A.V. Public School, Bariatu, Ranchi
    • Class XII (2009): 71% (First Class)
    • Class X (2007): 84% (First Class)

Employment History šŸ“…šŸ«

  • Assistant Professor, National Institute of Advanced Manufacturing Technology, Hatia, Ranchi (April 11, 2022 ā€“ April 10, 2023)
  • Assistant Professor, AKG Engineering College, Ghaziabad, Delhi NCR (July 9, 2018 ā€“ Dec 10, 2021)
  • Assistant Professor, IMS Engineering College, Ghaziabad, U.P. (July 19, 2017 ā€“ June 30, 2018)
  • Assistant Professor, Kanpur Institute of Technology, Kanpur, U.P. (January 18, 2016 ā€“ July 08, 2017)
  • Assistant Professor, Subhash Institute of Software Technology, Kanpur, U.P. (June 15, 2015 ā€“ January 17, 2016)

Roles & Responsibilities šŸ“‹

  • Collegewide Administrative Duties:
    • Syllabus Development for UG Course “Computer Engineering” and PG Course “Data Science”
    • Central ERP Coordinator at AKGEC
    • Initiated and Managed YouTube News Channel for AKGEC
    • Conducted Inter-College Hackathon and Coding Contests
  • Departmental & General Duties:
    • Delivering lectures and supervising labs
    • Departmental Student Attendance Management
    • Recruitment and mentoring students for competitions and projects
  • Academic Activities:
    • Delivered courses on AI, ML, and DL
    • Provided training under DST sponsored projects
    • Conducted hackathons and coding contests
    • Presented papers at national conferences

Courses Taught šŸ“˜šŸ‘Øā€šŸ«

  • BTech Classes: Operating Systems, Theory of Automata and Formal Languages, Machine Learning, Computer Networks, Artificial Intelligence, Database Management System

Publication Top Notes:

Convergence of various computer-aided systems for breast tumor diagnosis: a comparative insight

DNA sequence based data classification technique