latchkeyChild
/

deepseek-trading-assistant

Model card Files Files and versions Community

DeepSeek Trading Assistant

This is a fine-tuned version of DeepSeek-R1-Distill-Qwen-32B specialized for generating trading strategies and market analysis.

Model Details

Model Description

Developed by: latchkeyChild
Model type: Decoder-only language model
Language(s): English
License: MIT
Finetuned from model: deepseek-ai/DeepSeek-R1-Distill-Qwen-32B

Uses

Direct Use

This model is designed to:

Analyze market conditions using technical indicators
Generate trading strategies based on market analysis
Implement risk management rules
Create Python code for strategy implementation

Training Data

The model is trained on a custom dataset containing:

Market analysis using technical indicators (RSI, MACD, Moving Averages)
Trading strategy implementations
Risk management rules
Python code examples using QuantConnect framework

Training Procedure

Training Hyperparameters

Number of epochs: 3
Batch size: 2
Learning rate: 1e-5
Gradient accumulation steps: 8
Warmup steps: 100
Training regime: fp16 mixed precision with gradient checkpointing
Temperature: 0.6 (recommended for DeepSeek-R1 series)

Technical Specifications

Compute Infrastructure

Required Hardware: 2x NVIDIA A10G GPUs or 1x A100 GPU
Training Time (estimated): 2-4 hours

Model Card Contact

For questions or issues, please open an issue in the repository.

Downloads last month: -; Downloads are not tracked for this model. How to track

Inference Providers NEW

This model isn't deployed by any Inference Provider. 🙋 Ask for provider support