's Picture

Penguin Engineering

Tech tidbits for data professionals

  • Home
  • Curated
  • Contact
  • Search

data wrangling

A collection of 2 posts

August 1, 2021

Robust Pandas pipelines

Recently I built a data pipeline in Pandas that would be run on a weekly basis.

Python data wrangling pipelines reproducible

June 30, 2021

PySpark data trap - inferschema

PySpark is the Python API wrapper for Apache Spark, a big data processing framework.

Python PySpark data wrangling

Subscribe to the Penguin Engineering newsletter

Get the latest and greatest from Penguin Engineering delivered straight to your inbox every week.

Penguin Engineering © 2023. Royce theme by JustGoodThemes.
Powered by Jekyll.

Back to top