DESIGN AND DEVELOPMENT OF PREDICTIVE MODEL FOR ACTIVITY RECOGNITION USING MACHINE LEARNING

SINGH, ROSHNI; Sharma, Abhilasha (SUPERVISOR)

DSpace JSPUI

DSpace preserves and enables easy and open access to all types of digital content including text, images, moving images, mpegs and data sets

Learn More

Please use this identifier to cite or link to this item: http://dspace.dtu.ac.in:8080/jspui/handle/repository/22750

Full metadata record

DC Field	Value	Language
dc.contributor.author	SINGH, ROSHNI	-
dc.contributor.author	Sharma, Abhilasha (SUPERVISOR)	-
dc.date.accessioned	2026-06-08T05:40:18Z	-
dc.date.available	2026-06-08T05:40:18Z	-
dc.date.issued	2025-12	-
dc.identifier.uri	http://dspace.dtu.ac.in:8080/jspui/handle/repository/22750	-
dc.description.abstract	This thesis aims to develop robust and adaptive methods for activity recognition in challenging environments involving occlusions, low visibility, complex motion patterns, and dynamic backgrounds. While conventional methods have shown promise using handcrafted features or template-based models, their performance significantly degrades under real-world conditions due to limitations in generalization, sensitivity to noise, and dependency on clean, well-labeled datasets. To address these issues, the work explores multiple directions, including skeleton-based recognition, spatialtemporal modeling, attention mechanisms, low-light enhancement, and multimodal fusion. The proposed methods are designed to enhance both the accuracy and robustness of recognition systems in real-world settings. Initially, the thesis outlines a systematic literature review that analyzes 88 key publications from 2014 to 2024, selected from over 8,664 research papers. This review categorizes state-of-the-art HAR techniques based on their architectures, datasets, evaluation strategies, and challenges, highlighting the research gaps in handling real-time, noisy, and occluded scenarios. Based on these insights, a set of machine learning and deep learning frameworks, models and algorithms are proposed. The second work introduced a ConvST-LSTM-Net for skeleton-based activity recognition. This model identifies and processes only the most informative skeletal keyjoints in each frame, leveraging convolutional and spatiotemporal LSTM layers for effective long-term sequence modeling. To capture subtle spatial-temporal variations in video clips, a spatial-temporal attention-based, i.e., STAD-ConvBi-LSTM model is developed in the third work. This architecture integrates a dual attention mechanism with convolutional and bi-directional LSTM networks to extract discriminative humancentric features. The method demonstrates exceptional performance across various datasets and a custom synthetic dataset, achieving recognition accuracies exceeding 96%. For recognizing the challenge of occlusion in skeleton-based data, a MultiStream Part-Aware Spatial-Temporal Graph Convolutional Network as MSPAST-GCN is proposed. This model uses a part-aware inhibition strategy and a graph convolutionbased architecture to effectively model keyjoint relationships, even in the presence of missing or noisy data. It outperforms prior methods with a 6% accuracy gain on occlusion-affected datasets. For video-based activity classification, a hybrid model named MV-DBiLSTM is presented, which combines MobileNetV2 for spatial feature extraction with a Deep Bi-LSTM network for learning temporal dependencies. This framework balances computational efficiency and deep temporal reasoning, making it suitable for deployment in smart systems. In visually challenging conditions like lowlight environments, where traditional recognition systems face challenges. This thesis proposes a low-light enhancement pipeline integrated with HAR models. A combination of local enhancement modules and transformer-based global adjustment is used to improve visibility without distorting critical features. This significantly improves activity detection in surveillance scenarios under poor lighting. All proposed models are rigorously validated across benchmark and synthetic datasets using both quantitative and qualitative assessments. The analysis demonstrates that all the presented methods outperform contemporary approaches in terms of recognition accuracy, temporal consistency, and adaptability to diverse real-world conditions. Overall, this thesis contributes multiple novel activity recognition architectures tailored for different challenges: occlusion, temporal complexity, lighting conditions, and data constraints. These contributions enable the development of more smart, intelligent, reliable, and context-aware recognition systems, with impactful applications in surveillance, healthcare, smart homes, and assistive technologies.	en_US
dc.language.iso	en	en_US
dc.relation.ispartofseries	TD-8656;	-
dc.subject	PREDICTIVE MODEL	en_US
dc.subject	MACHINE LEARNING	en_US
dc.subject	ACTIVITY RECOGNITION	en_US
dc.subject	LSTM MODEL	en_US
dc.title	DESIGN AND DEVELOPMENT OF PREDICTIVE MODEL FOR ACTIVITY RECOGNITION USING MACHINE LEARNING	en_US
dc.type	Thesis	en_US
Appears in Collections:	Ph.D. Computer Engineering

Files in This Item:

File	Description	Size	Format
ROSHNI SINGH Ph.D..pdf		16.3 MB	Adobe PDF	View/Open
ROSHNI SINGH Plag..pdf		26.72 MB	Adobe PDF	View/Open

Show simple item record