Face_recognition

Sleeping

App Files Files Community

Kaushik066 commited on 22 days ago

Commit

b9bd11b

verified ·

1 Parent(s): e965ca9

Update app.py

Browse files

Files changed (1) hide show

app.py +46 -17

app.py CHANGED Viewed

@@ -242,26 +242,55 @@ about_tab, app_tab = st.tabs(["About the app", "Face Recognition"])
 with about_tab:
     st.markdown(
         """
-        ## Product Description/Objective
-        An AI face recognition app for automated employee attendance uses advanced facial recognition technology to accurately and efficiently track employee attendance.
-        By simply scanning employees' faces upon arrival and departure, the app eliminates the need for traditional timecards or biometric devices, reducing errors and fraud.
-        It provides real-time attendance data, enhances workplace security, and streamlines HR processes for greater productivity and accuracy.
-        ## How does it work ?
-        Our app leverages Google's advanced **Vision Transformer (ViT)** architecture, trained on the **LFW (Labeled Faces in the Wild) dataset**, to deliver highly accurate employee attendance tracking through facial recognition.
-        The AI model intelligently extracts distinct facial features and compares them to the stored data of registered employees. When an employee’s face is scanned, the model analyzes the key features, and a confidence score is generated.
-        A high score indicates a match, confirming the employee’s identity and marking their attendance automatically. This seamless, secure process ensures precise tracking while minimizing errors and enhancing workplace efficiency.
-        ### About the architecture.
-        The Vision Transformer (ViT) is a deep learning architecture designed for image classification tasks, which applies transformer models—originally developed for natural language processing (NLP)—to images.
-        ViT divides an image into fixed-size non-overlapping patches. Each patch is flattened into a 1D vector, which is then linearly embedded into a higher-dimensional space. The patch embeddings are processed using a standard transformer encoder.
-        This consists of layers with multi-head self-attention and feed-forward networks. The transformer is capable of learning global dependencies across the entire image.
-        The Vision Transformer outperforms traditional convolutional neural networks (CNNs) on large-scale datasets, especially when provided with sufficient training data and computational resources.
-        ### About the Dataset.
-        Labeled Faces in the Wild (LFW) is a well-known dataset used primarily for evaluating face recognition algorithms. It consists of a collection of facial images of famous individuals from the web.
-        LFW contains 13,000+ labeled images of 5,749 different individuals. The faces are collected from various sources, with images often showing individuals in different lighting, poses, and backgrounds.
-        LFW is typically used for face verification and face recognition tasks. The goal is to determine if two images represent the same person or not.
         """)
 # Gesture recognition Tab

 with about_tab:
     st.markdown(
         """
+        # 👁️‍🗨️ AI-Powered Face Recognition Attendance System
+        Effortless, Secure, and Accurate Attendance with Vision Transformer Technology
+        An intelligent, facial recognition-based attendance solution that redefines how organizations manage employee presence. By leveraging cutting-edge computer vision and AI, the app automates attendance tracking with speed, precision, and reliability—no timecards, no fingerprint scans, just a glance.
+        ## 🎯 Project Objective
+        To eliminate outdated, manual attendance methods with a seamless, contactless facial recognition system. Our solution not only improves the accuracy of attendance logs but also boosts workplace security and streamlines HR operations—all in real time.
+        Employees are simply scanned as they enter or leave the premises. Their attendance is automatically logged, reducing the risk of buddy punching, manual entry errors, and delays in record-keeping.
+        ## 🧠 How It Works: The AI in Action
+        At the core of this app is Google’s Vision Transformer (ViT) architecture, trained on the Labeled Faces in the Wild (LFW) dataset for robust, real-world face recognition.
+        - **Face Detection & Feature Extraction**
+        The model scans an employee’s face and extracts a high-dimensional representation of their unique features.
+        - **Identity Matching with Confidence Scoring**
+        The scanned features are compared to stored profiles. If the confidence score crosses a threshold, the model confirms the match and automatically marks attendance.
+        - **Real-Time Logging**
+        The app logs entry and exit times in real-time, providing live dashboards and attendance reports for HR and management.
+        ## 🏗️ About the Architecture: Vision Transformer (ViT)
+        The Vision Transformer (ViT) brings the power of transformer models—originally created for language—to the world of images. Here's how it works:
+        - An input image is split into fixed-size non-overlapping patches.
+        - Each patch is flattened and embedded into a higher-dimensional space.
+        - These embeddings are fed into a transformer encoder, which learns complex spatial and contextual relationships across the entire image using multi-head self-attention.
+        - ViT’s ability to capture global dependencies enables it to outperform traditional CNNs when trained on sufficient data.
+        This makes it ideal for high-accuracy face recognition in dynamic, real-world environments.
+        ## 📚 About the Dataset: Labeled Faces in the Wild (LFW)
+        To train the model, we used the renowned Labeled Faces in the Wild (LFW) dataset, consisting of 13,000+ facial images, 5,749 individuals, each shown in diverse lighting, angles, and backgrounds. Sourced from real-world photographs of public figures. Benchmark dataset for tasks like face verification and recognition. The diversity in LFW ensures our model is resilient to variations in appearance, making it highly reliable in real-world workplace scenarios.
+        ## ✅ Key Features
+        - Fast, contactless attendance logging
+        - High-security identity verification
+        - Real-time data and analytics
+        - Powered by state-of-the-art Vision Transformer architecture
+        - Eliminates manual records, reduces fraud, enhances efficiency
+        ## 👥 Use Cases
+        - Corporate Offices: Accurate time tracking and security for large workforces
+        - Factories & Warehouses: Contactless attendance in high-throughput environments
+        - Educational Institutions: Seamless student and staff attendance
+        - Healthcare & Public Services: Ensures hygienic, automated check-ins
+        ## 🚀 Future Scope
+        Looking ahead, we aim to integrate multi-face detection for group scanning, mask-aware recognition, and cross-location synchronization for distributed teams—all while preserving data privacy and security.
         """)
 # Gesture recognition Tab