Commit History

Refactor train.py to utilize a comprehensive configuration structure from config.yaml, enhancing model loading, dataset handling, and trainer setup. This update centralizes parameters for model, PEFT, dataset, and training settings, improving maintainability and flexibility.
611c848
unverified

mjschock commited on

Add hydra integration and configuration support in train.py, allowing dynamic model loading and training control. Update requirements.txt to include hydra-core dependency and introduce config.yaml for model parameters and training settings.
5bfd071
unverified

mjschock commited on

Refactor model loading in train.py to use a default model name parameter, enhancing flexibility. Adjust configuration for max sequence length and dtype for improved clarity and consistency.
aecd650
unverified

mjschock commited on

Refactor imports in train.py to improve organization and clarity, adding DataCollatorForLanguageModeling for enhanced data handling during training.
aae861a
unverified

mjschock commited on

Add DataCollatorForLanguageModeling to trainer configuration in train.py for improved data handling during training.
8ca2c5b
unverified

mjschock commited on

Update requirements.txt to specify unsloth version 2025.4.5 and refactor imports in train.py for improved organization and clarity.
04d059b
unverified

mjschock commited on

Remove unused dataset_text_field parameter from create_trainer function in train.py to streamline trainer configuration.
8bd5794
unverified

mjschock commited on

Update trainer configuration in train.py to align evaluation strategy with save strategy. Set eval_steps to match save_steps for consistent evaluation frequency.
d577d8c
unverified

mjschock commited on

Refactor trainer configuration in train.py for improved clarity. Clean up comments and ensure consistent formatting in evaluation strategy and model selection parameters.
aa6b654
unverified

mjschock commited on

Refactor train.py to improve code readability and organization. Adjust logging setup for clarity, streamline dependency installation commands, and enhance dataset splitting and formatting processes. Ensure consistent formatting in log messages and code structure.
9a87cb8
unverified

mjschock commited on

Enhance training script for SmolLM2-135M model by adding logging functionality, improving error handling, and implementing dataset validation split. Refactor model loading and dataset preparation processes for better clarity and robustness. Update trainer configuration to include evaluation strategy and logging of final metrics.
70eb9de
unverified

mjschock commited on

Add training script for SmolLM2-135M model using Unsloth. Includes model loading, dataset preparation, and training configuration. Provides detailed instructions for setup and execution.
7749830
unverified

mjschock commited on