Marcos Huerta
Home
About Me
Posts
Projects
Posts: Data is Singular
Occasional posts about code, data science, and machine learning.
Live Semantic Search with DuckDB, BERT, and LLama.cpp
When searching a set of documents there are, in general, two kinds of search: lexical search and semantic search. Lexical searches match vocabulary or words. Search for…
Mar 5, 2024
(Almost) A Year of EV Driving Data
In March we purchased a (used) 2022 Kia EV6. We have the rear-wheel-drive model, which means better range/efficiency but no heat pump: in the winter it uses a…
Dec 31, 2023
Posit Conference 2023
These are “talk notes” for my 2023 Conference talk.
Sep 17, 2023
All That Whispers
I wanted to summarize the various projects and links related to OpenAI’s Whisper transcription software and model. Here is what I have discovered and noticed, collected into…
Jan 19, 2023
Streamlit or Dash (or Shiny for Python?!)
I virtually attended part of the Rstudio conference, more specifically I watched a few streams and also participated a bit on their neat conference Discord server.
Jul 31, 2022
Wordle is Fun (for programmers and data scientists)
So, yes obviously wordle is a fun game to play, but I refer actually to the multitude of interesting programming and analytical efforts that have come out of the game. Many…
Mar 6, 2022
Advent of Code 2021
I … think that was fun? I didn’t finish. The holidays and holiday travel completely thwarted attempts on the 23rd-25th and now I find myself demotivated and intimidated by…
Dec 28, 2021
Deploying Uwsgi to host Dash apps
When I set up this site, I wanted to host web apps, not just static pages, and my initial goal was a Shiny Server. While somewhat challenging, the good news was that once it…
Aug 22, 2021
Converting Garmin GPX files to Pandas and CSV
When I started running with an Apple Watch, I learned about the excellent HealthFit app which, in addition to syncing to a large number of run tracking sites like Strava and …
Aug 14, 2021
No matching items