Archive Β· Β· 2 min read

AWS Data Wrangler

AWS Labs has a lot of open source code up on GitHub. AWS Data Wrangler is a bridge between python pandas DataFrames and AWS data services.

AWS Data Wrangler

AWS Labs has a lot of open source code up on GitHub. In this post, we’re taking a look at AWS Data Wrangler. This project provides a smoother interface between python pandas DataFrames and various AWS Cloud data services.

I call out a few more details in the Twitter thread below…

Tweet 1/8 πŸ‘‡ Next tweet

AWS Data Wrangler, https://github.com/awslabs/aws-data-wrangler, is an interesting project from the @awscloud ProServe team it aims to connect python pandas data frames to various AWS services 🧡☁️ #cloud #devops

Tweet 2/8 πŸ‘‡ Next tweet πŸ‘† Start

this thread (unrolled) is up at https://t.co/v1ZIBgC6W3 the last thread is up at https://markn.ca/2021/amazon-api-gateway-serverless-developer-portal/ 🧡☁️ #cloud #devops

Tweet 3/8 πŸ‘‡ Next tweet πŸ‘† Start

if you've been anywhere near a data science project, you've probably seen either the scipy, numpy, or pandas projects in python...or all three they are awesome one of the fundamental units of these projects is the DataFrame, https://pandas.pydata.org/pandas-docs/stable/user_guide/dsintro.html 🧡☁️ #cloud #devops

Tweet 4/8 πŸ‘‡ Next tweet πŸ‘† Start

the AWS Data Wrangler project lets you save DataFrames to various @awscloud data services this could save a ton of time for your python projects πŸ‘‡ 🧡☁️ #cloud #devops

Tweet 5/8 πŸ‘‡ Next tweet πŸ‘† Start

the repo has a broad spectrum of samples, all in @ProjectJupyter notebooks. I ❀️ that because it makes it easier to play with the code find the tutorials at https://github.com/awslabs/aws-data-wrangler/tree/main/tutorials 🧡☁️ #cloud #devops

Tweet 6/8 πŸ‘‡ Next tweet πŸ‘† Start

this project installs via standard pip but is also available as a Lambda layer, in the AWS Glue shell, in @awscloud SageMaker Notebooks, and more that flexibility is much appreciated 🧡☁️ #cloud #devops

Tweet 7/8 πŸ‘‡ Next tweet πŸ‘† Start

this library does more than just save and load data, the full API for it is up at https://aws-data-wrangler.readthedocs.io/en/2.12.1/api.html there's a lot of very useful data manipulation functions here 🧡☁️ #cloud #devops

Tweet 8/8 πŸ‘‡ Next tweet πŸ‘† Start

all-in-all if you're using pandas, scipy, or numpy in your python project and your data is on @awscloud, you'll want toβ€”at the very leastβ€”check out the AWS Data Wrangler at https://github.com/awslabs/aws-data-wrangler /🧡☁️ #cloud #devops

Read next