NEW5,000 candles per pair (208 days / 7 months) — 10x more training data than min-2. Fetched in batches of 1,000 to work within Binance API limits.
NEWLSTM internal validation — 80/20 train/val split with early stopping (patience=3). Weights only saved when validation loss improves.
NEWZero-trade validation detection — 3 consecutive zero-trade validations automatically resets the RL policy.
FIXScaler data leakage fixed — MinMaxScaler now fitted on training split only. Validation and test data no longer leak future price ranges into training.
FIXLSTM padding fixed — zero padding instead of edge replication. Early episodes no longer see fabricated repeated prices.
FIXBinance rate limit infinite retry fixed — now capped at 3 attempts with clear error on failure.
FIXKnowledge query redundancy fixed — queried once per step instead of 2-3 times. Significant training speedup.
FIXStale features after LSTM retrain fixed — all symbols now get fresh candles and features after every retrain cycle.
IMPROVEDGamma reduced from 0.99 to 0.95 — agent now focuses on shorter-term rewards, better suited for crypto volatility.
IMPROVEDOvertrading timestamps now only count actual trade executions, not attempted no-op actions.
IMPROVEDAPI startup now fails loudly if LSTM or scaler initialization fails, instead of silently serving broken responses.