linear content

Think Harder, Sample Once

nlp

Comparing single-shot LLM paper selection against multi-run consensus voting.

Jan 3, 2025

Placement Matters in Model Evaluation: Comparing Summaries

nlp

When evaluating LLMs, the order you present data matters more than you’d think. I found that models consistently favor the second option when comparing summaries—unless you make them think first.

Mar 22, 2024

Recursive Summarization by Parts

nlp

Long documents don’t fit in context windows. Here’s a practical chunking approach that iteratively distills text into concise notes, preserving what matters.

Jan 9, 2024

No Quantization for Small Model

nlp

Quantization saves memory, but does it hurt quality on smaller 7B models? I ran the same summarization task across Q3, Q4, Q5 and Q8 variants. The differences were smaller than expected.

Dec 30, 2023

A Closer Look at una-cybertron-7B-v2 for Summarization

nlp

There’s drama on HuggingFace about data contamination in top-ranked models. I’ve been using una-cybertron-7B-v2 for summarization—time to take a closer look at what’s actually going on.

Dec 16, 2023

LLM Technical Note: Picking the Right Quantization Method for Local Inference

nlp

Q4_K_M, Q5_K, Q8_0—what do these cryptic codes mean? A quick reference for picking the right quantization method when running LLMs locally.

Sep 10, 2023

Financial News Sentiment Classification with GPT Models

nlp

Does GPT-4 justify its cost for domain-specific tasks? I test both GPT-3.5 and GPT-4 on financial news sentiment classification to find out.

Jun 18, 2023

On Addiction and Attachment

random

We talk about addiction in terms of substances, but the real dependencies run deeper. Some thoughts on the attachments we rarely examine.

May 17, 2023

The Pursuit of Longevity: Evidence-Based Strategies for a Longer Life

random

A distillation of evidence-based longevity practices: exercise, sleep, diet, supplements. No magic bullets—just what the research actually supports.

May 17, 2023

Data Science Concept: Cholesky Decomposition

data science

Cholesky decomposition breaks a matrix into triangular parts. Useful for solving linear systems, simulating correlated variables, and speeding up optimization.

Sep 14, 2021

Building a Word Embeddings Model

nlp

Word embeddings turn text into vectors that encode meaning. I walk through building one from scratch using TensorFlow and the Gutenberg Encyclopedia.

Nov 28, 2016

A PCA Classifier for Landscapes

data science

Neural networks are powerful but expensive. For simpler image tasks, PCA can get you surprisingly far. Here I classify landscapes vs impressionist paintings with just eigenvectors.

Oct 12, 2016

FML or Just Made my Day?

nlp

FML posts are depressing; JustMadeMyDay posts are uplifting. I build a Naive Bayes classifier to tell them apart—and see which words carry the most weight.

Sep 24, 2016

Bukowski’s Poems Sentiment Analysis

nlp

I had 1,363 Bukowski poems lying around from a neural network project. Might as well run sentiment analysis and see what themes emerge from all that beautiful misery.

Aug 25, 2016

Categories