data & machine learning

Things I’ve had fun learning and exploring.

Title Art

Ulam Spiral from Patterns of the Universe. Starting with 1 in the center, each square in a 20x20 grid is numbered in a counter-clockwise spiral. I chose to color all primes red, then even numbers true blue, ones divisible by 3 avocado green, and so on. Colors by me.

Big Data

Store ALL the Data
I had a great time designing and building a Distributed File System (DFS) with a team and wanted to share some of the things I learned.
Split it. Map it. Send it. Count it!
With a team, I explored concurrency and synchonization by creating a MapReduce application. It’s a simple use case - count the number of occurrences of unique words - but requires muilti-threading, locks, and a barrier to get it right.
‘Bloom’ing Efficient Filter
What if we want to count a small subset of words instead of all words? What if our small subset is still a lot of words? Bloom filters are commonly used for quick filtering with a reduced memory footprint.

Resources

Tools for Adding Content
Everything I found helpful in building and adding to this site.