Why this topic?
When we join two flat files, or a flat file to a SQL query output in Python, they probably come from 2 different sources. Therefore, we can’t assume that they’re engineered to be combined directly.
What it covers:
I walk through 2 examples of joining data from completely different sources. In both cases, I have to understand the underlying structure and aggregation, then configure the data to ensure accurate mapping between the DataFrames.