Tag: performance
All the articles with the tag "performance".
-
Low-Dimensional Representations: Projection, MRL, and Sparse Representations
In large-scale retrieval systems, embedding cost comes from model inference, vector storage, memory bandwidth, and KNN search. This post compares projection, MRL, and CSR-style sparse representations.
-
A Deep Dive into Sparse Matrices in PyTorch 2.12: COO, CSR, CSC, BSR, and BSC
A look at COO, CSR, CSC, BSR, and BSC in PyTorch 2.12: how sparse matrices are stored, how multiplication is routed, and what one CPU/GPU benchmark says about storage and speed ratios.