latest
Aug
12
The Convergence of Proprietary and Open Source LLMs
Open and private models are becoming more similar than they are different
6 min read
Apr
26
How to Beat Proprietary LLMs With Smaller Open Source Models
Building your AI applications around open source models can make them better, cheaper, and faster
14 min read
Apr
08
A Guide to Structured Outputs Using Constrained Decoding
The how, why, power, and pitfalls of constraining generative language model outputs
14 min read
Jul
22
Modern Data Engineering and the Lost Art of Data Modelling
Necessity was the mother of invention. Now, an abundance of cheap storage and compute makes for data anarchy.
5 min read
Jun
23
Machine Learning in the Life Sciences Has a Data Problem
In a time of AI prosperity, the life sciences are at risk of being left behind
6 min read
Jun
07
Approximating Shapley Values for Machine Learning
The how and why of Shapley value approximation, explained in code
6 min read
Apr
07
Gnillehcs' Model of Integration
What happens to segregated communities as people increasingly seek diversity?
3 min read
Dec
31
How Shapley Values Work
In this article, we will explore how Shapley values work - not using cryptic formulae, but by way of code and simplified explanations
10 min read
Aug
13
Industry Perspective: Tree-Based Models vs Deep Learning for Tabular Data
Tree-based models aren't just highly performant - they offer a host of other advantages
3 min read
Jul
11
4 Pandas Anti-Patterns to Avoid and How to Fix Them
pandas is a powerful data analysis library with a rich API that offers multiple ways to perform any given data
9 min read