Spaces:
Sleeping
Sleeping
project_root/ | |
βββ app.py # The main Gradio endpoint that runs the training pipeline. | |
βββ requirements.txt # Lists the Python dependencies. | |
βββ source_files/ # Directory containing your input data files. | |
β βββ quranic-corpus-morphology-0.4.txt | |
β βββ en.sample.quran-maududi.txt | |
β βββ en.w4w.qurandev.txt | |
βββ working_directory/ # Directory for intermediate outputs. | |
βββ processed_data/ # Processed verse data (JSON and TXT files). | |
βββ checkpoints/ # Checkpoints saved during training. | |
βββ logs/ # (Optional) Additional log files. | |
βββ state/ # Pipeline state files (e.g., pipeline_state.json). |