Thought Eddies

2024-07-16

VLMs aren't blind

I attempted to reproduce the results for one task from the VLMs are Blind paper. Specifically, Task 1: Counting line intersections. I ran 150 examples of lines generated by the code from the project with line thickness 4.

vlms claude-3.5-sonnet

2024-07-12

Challenges and Opportunities of the Impact of Language Models on Software Engineering

I'm trying something a bit new, writing some of my thoughts about how the future might look based on patterns I've been observing lately.

language_models thoughts

2024-07-06

Claude Artifacts

I spent some time working with Claude Artifacts for the first time. I started with this prompt I want to see what you can do. Can you please create a 2d rendering of fluid moving around obstacles of different shapes?

claude artifacts claude-3.5-sonnet

2024-06-23

Claude 3.5 Sonnet Codes Really Well

One of my favorite things to do with language models is to use them to write code. I've been wanting to build a variation on tic-tac-toe involving a bit of game theory. I called it "Tactic". I wasn't even really sure if the game would be any more interesting than tic-tac-toe itself, which reliably...

claude-3.5-sonnet tactic

2024-06-18

Language model-based aggregators

Model-based aggregators

language_models aggregators

2024-06-13

Learning How to Learn

I completed Barbara Oakley's "Learning How to Learn" course on Coursera. The target audience seems to be students, but I found there were helpful takeaways for me as well, as someone who is a decade out of my last university classroom.

learning productivity coursera

2024-06-05

Switching From Pocket to Raindrop for bookmarks

I've been using Pocket for a long time to keep track of things on the web that I want to read later. I save articles on my mobile or from my browser, then revisit them, usually on my desktop. Some articles I get to quickly. Others remain in the stack for a long time and can become...

pocket raindrop bookmarks

2024-05-15

Evals: unit testing for language models

Generative AI and language models are fun to play with but you don't really have something you can confidently ship to users until you test what you've built.

evals language_models

2024-01-31

Language Model Streaming With SSE

OpenAI popularized a pattern of streaming results from a backend API in realtime with ChatGPT. This approach is useful because the time a language model takes to run inference is often longer than what you want for an API call to feel snappy and fast. By streaming the results as they're produced,...

sse vercel language_models python fastapi

2024-01-21

Sandboxed Python Environment

Disclaimer: I am not a security expert or a security professional.

language_models python security nix docker

Posts

Keyboard Shortcuts

Global

Navigation