Spaces:
Running
Running
Description
Browse files
app.py
CHANGED
@@ -10,8 +10,8 @@ warnings.filterwarnings('ignore')
|
|
10 |
|
11 |
# Define problem statement
|
12 |
problem_statement = """
|
13 |
-
###
|
14 |
-
This project aims to
|
15 |
"""
|
16 |
|
17 |
# Define solution overview
|
|
|
10 |
|
11 |
# Define problem statement
|
12 |
problem_statement = """
|
13 |
+
### Problem Statement
|
14 |
+
This project aims to develop a deep learning model to verbally describe image contents for the visually impaired using caption generation with an attention mechanism on the Flickr8K dataset. Inspired by the "Show, Attend and Tell: Neural Image Caption Generation with Visual Attention" paper, the model utilizes a CNN-RNN architecture to extract image features and generate captions, facilitating accessibility. The Kaggle dataset comprises 8,000 images, each paired with five descriptive captions, enabling comprehensive understanding of image content.
|
15 |
"""
|
16 |
|
17 |
# Define solution overview
|