A hybrid neural recommender system trained on the Amazon Musical Instruments dataset, fusing collaborative-filtering ID embeddings with multi-modal item attributes — product text (TF-IDF LSA), price, category, and brand.
Evaluated on held-out test set (20% split) · Amazon Musical Instruments · 99.2% sparsity · 30-epoch Adam + BPR (CosineAnnealingLR)
Symmetrical Hybrid Engine — two mirrored towers each fusing ID embeddings with attribute MLPs
SGD plateau diagnosis → Adam + BPR + CosineAnnealingLR drives loss from 0.717 → 0.115 over 30 epochs
Three variants trained identically (Adam + BPR, 20 epochs) — isolates contribution of each signal
| Variant | ID Embed | Attr Enc | HR@10 | NDCG@10 | Params |
|---|---|---|---|---|---|
| ID-Only | ✅ | ❌ | 0.0100 | 0.0063 | 2,781,697 |
| Attr-Only | ❌ | ✅ | 0.0000 | 0.0000 | 28,353 |
| Full Hybrid ★ | ✅ | ✅ | 0.0367 | 0.0155 | 2,799,105 |
Key finding: Full Hybrid (30 epochs, CosineAnnealingLR) achieves HR@10=0.0367 and NDCG@10=0.0155 — significantly outperforming all ablation variants. Attr-Only scores zero, confirming collaborative filtering (ID embeddings) is essential. ID-Only runs for only 20 epochs in the ablation vs 30 for the full model; the full model's longer training and combined signals explain the gap. The 2.8M item embeddings cover the full Amazon catalog (84,901 items), enabling cold-start recommendations even for items with no interactions.
73-dimensional item vectors combining structured metadata with text-derived semantic signals
Multi-modal product attributes extracted from metadata
TF-IDF: 1,000-vocab bigrams on product titles → TruncatedSVD(32) — 27.6% variance explained
Interaction-derived behavioral signals
log_reviews and last_ts standardized with StandardScaler. Positive interactions = rating ≥ 4.
ML Framework
PyTorch 2.x CUDA GPUData
pandas numpyFeatures
scikit-learn TF-IDF+SVDOptimizer
Adam + BPR CosineAnnealingLR