This post covers:
- how to extract tabular data from a PDF file using the
tabula
library - how to transform and clean it
- how to load it to Snowflake by creating a temporary table, uploading the CSV file to e.g. S3, then copying the data into the temp table, and finally inserting the data from the temp table into a target table.