ForeverYoung

Low-Dimensional Representations: Projection, MRL, and Sparse Representations

27 May, 2026

In large-scale retrieval systems, embedding cost comes from model inference, vector storage, memory bandwidth, and KNN search. This post compares projection, MRL, and CSR-style sparse representations.

A Deep Dive into Sparse Matrices in PyTorch 2.12: COO, CSR, CSC, BSR, and BSC

25 May, 2026

A look at COO, CSR, CSC, BSR, and BSC in PyTorch 2.12: how sparse matrices are stored, how multiplication is routed, and what one CPU/GPU benchmark says about storage and speed ratios.

Think Before You Embed

17 May, 2026

Production search systems rewrite queries with LLMs before embedding them. Two ICLR 2026 papers ask what happens when elaboration and embedding share a model and a gradient.

Data Visualization with Hand-Drawn/Sketchy Style

18 Apr, 2022

A survey of tools for creating hand-drawn/sketchy style data visualizations: rough.js, draw.io, matplotlib xkcd, chart.xkcd, and cutecharts.

Model Size vs. Inference Speed in Deep Learning

4 Mar, 2022

An examination of how FLOPs, parameter count, memory access volume, and memory footprint affect inference speed, with practical network design recommendations for different hardware platforms.