How To Migrate Multi-Terabyte Datasets to a PGVector DatabaseLessons on Big Data learned through trial and errorJun 27Jun 27
Published inStackademicSpark Performance Tuning for BigQuery APIsAt the end of 2023, my team was given an important initiative — run a Natural Language Processing (NLP) algorithm on about a billion pages…Mar 26Mar 26
Published inStackademicReducing BigQuery Costs by 100–200x with dbt Incremental ModelsMany models my team works with at Tempus are large, but not “big data” large. Usually, our tables hover around the hundred-million rows…Feb 72Feb 72
Several Examples of Why AI Is Not Going to Replace Our JobsAnytime soon at least. Disclaimer: this article is not serious.Jan 31Jan 31
Published inStackademicThree Key BigQuery Optimizations We Should All Be UsingDuring a recent BigQuery debugging session, I dove deep into BigQuery and learned a few important lessons that I believe are not well-known…Jan 94Jan 94
Debug Diaries: duplicate records in dbt SnapshotsHi, my name is Noah — I’m a Senior Data Engineer at Tempus AI. I write about the tech challenges I face with the hope of creating that…Dec 14, 2023Dec 14, 2023
Published inStackademicSQL and dbt 101I volunteer on the Tempus Board of Education (TBOE) in my free time. TBOE is a group that aims to provide Tempus employees with courses and…Nov 20, 20233Nov 20, 20233
Elementary on dbt — An OverviewIf you’ve read my previous articles, you’ll know that I’m a dbt evangelist and a huge advocate of the importance of proper testing…Nov 8, 2023Nov 8, 2023
Four Simple Frameworks to Improve Communication Skills as an EngineerFrameworks to help you be a more effective communicator, and therefore a more effective Individual Contributor (IC).Oct 31, 20231Oct 31, 20231
Three dbt Macros I Use Every DayLast month my team here at Tempus built out a new data mart using our normal tech stack of SQL + AirFlow + dbt in GCP, and I realized that…Jul 20, 20231Jul 20, 20231