krishnapal2308 commited on
Commit
4e40bc0
·
verified ·
1 Parent(s): b837217

Description

Browse files
Files changed (1) hide show
  1. app.py +2 -2
app.py CHANGED
@@ -10,8 +10,8 @@ warnings.filterwarnings('ignore')
10
 
11
  # Define problem statement
12
  problem_statement = """
13
- ### Overview
14
- This project aims to generate descriptive spoken captions for images, leveraging CNNs and RNNs for feature extraction and sequence generation, respectively. The model is trained on the Flickr8K dataset and extended with an attention mechanism for enhanced accessibility.
15
  """
16
 
17
  # Define solution overview
 
10
 
11
  # Define problem statement
12
  problem_statement = """
13
+ ### Problem Statement
14
+ This project aims to develop a deep learning model to verbally describe image contents for the visually impaired using caption generation with an attention mechanism on the Flickr8K dataset. Inspired by the "Show, Attend and Tell: Neural Image Caption Generation with Visual Attention" paper, the model utilizes a CNN-RNN architecture to extract image features and generate captions, facilitating accessibility. The Kaggle dataset comprises 8,000 images, each paired with five descriptive captions, enabling comprehensive understanding of image content.
15
  """
16
 
17
  # Define solution overview