hackerllama
  • About
  • Blog
  • Misc Resources

Blog

A minimal Introduction to Quantization

For the last couple of weeks, I’ve been considering writing some introductory content for quantization. After exploring a bit more, I realized there are many great resources…
Aug 4, 2024

LLM Evals and Benchmarking

You go to Hugging Face, and you see there are 60 thousand text generation models, and you feel lost. How do you get the best model for your use case? How to get started? The…
Mar 10, 2024

Sentence Embeddings. Cross-encoders and Re-ranking

Deep Dive into Cross-encoders and Re-ranking
Jan 20, 2024

The Llama Hitchiking Guide to Local LLMs

Here are some terms that are useful to know when joining the Local LLM community.
Jan 12, 2024

Sentence Embeddings. Introduction to Sentence Embeddings

Everything you wanted to know about sentence embeddings (and maybe a bit more)
Jan 7, 2024

Transformer model from the original “attention is all you need” paper

The Random Transformer

Understand how transformers work by demystifying all the math behind them
Jan 1, 2024

 

The GPU Poor strike back

Some months ago, SemiAnalysis published a flashy article with the premise that organizations with GPUs in the magnitude of tens of thousands had so many resources that the…
Dec 15, 2023
No matching items
  • Report an issue