Power of Parquet: Improving efficiencies using the Apache Ecosystem
Regular talk, 3:40 – 4:40 PM
This talk introduces the Apache data storage and access ecosystem and demonstrates the benefits of utilizing it in R workflows. The talk begins by an explanation of the Apache ecosystem's unique approach to data storage and the role each each component of the ecosystem in handling data storage. I then demonstrate how using these elements in combination can improve efficiency in data storage, read time and tabular and geospatial analysis resulting in faster, cleaner workflows.
![]() |
Pronouns: he/himPortland, OR, USAAditya performs economic policy analysis for projects related to housing, transportation, and economic development at ECOnorthwest, a policy consultancy based in Portland. Specializing in urban economics, demography and data science, Aditya oversees technical workflows involving econometric modeling, spatial analysis, transportation modeling, and more. |
