manaviel85370
update local installation info in readme
8da1a53

A newer version of the Streamlit SDK is available: 1.44.1

Upgrade
metadata
title: Event Data Extraction
emoji: 🌐
colorFrom: pink
colorTo: blue
sdk: streamlit
sdk_version: 1.42.2
app_file: app.py
pinned: false
python_version: 3.10.0

Check out the configuration reference at https://huggingface.co/docs/hub/spaces-config-reference

Event Data Extraction

A testing and demo application for extracting event-data from websites.

Repository overview

/pages/                  
β”‚   └── Streamlit pages for the UI
β”‚
/src/
β”œβ”€β”€ configuration/       
β”‚   └── Streamlit-specific configuration files
β”‚
β”œβ”€β”€ crawler/             
β”‚   └── Scripts for crawling and collecting event data from websites
β”‚
β”œβ”€β”€ persistence/         
β”‚   └── Database connections and query logic
β”‚
β”œβ”€β”€ utils/               
β”‚   └── Helper functions and preprocessing utilities
β”‚
β”œβ”€β”€ nlp/
β”‚   β”œβ”€β”€ experimental/    
β”‚   β”‚   └── Various NLP tools and technologies under evaluation
β”‚   β”‚
β”‚   └── playground/      
β”‚       └── NLP scripts used within the Streamlit app (Pages: Playground, Pipeline, Testing)

Run locally

Python Version: 3.10

  1. Install requirements from requirements.txt file

  2. Create Hugging Face Access Token in Hugging Face Platform

  3. Request missing environment variables

  4. Create a .env file in the root directory with the following environment variables (⚠️ Do NOT commit this file!)

# MongoDB
MONGO_HOST=...
MONGO_USERNAME=...
MONGO_PASSWORD=...

# Google Maps API
GOOGLE_MAPS_API_KEY=...

# OpenAI API
OPENAI_API_KEY=...

# Hugging Face Inference API
INFERENCE_API_TOKEN=...

# Hugging Face Spaces (access token)
HUGGING_FACE_SPACES_TOKEN=...

# Google Cloud Platform API
GOOGLE_API_KEY=...
  1. Start streamlit app in browser
  streamlit run app.py