Data Engineering Specialist Needed

Closed Posted last week Paid on delivery
Closed Paid on delivery

I'm looking for a data engineer with solid Pyspark knowledge to assist in developing a robust data storage and retrieval system, primarily focusing on a Data Warehouse.

Key Responsibilities:

- Implementing efficient data storage solutions for long-term retention and retrieval

- Ensuring data quality and validation procedures are in place

- Advising on real-time data processing capabilities

Ideal Candidate:

- Proficient in Pyspark with hands-on experience in data storage and retrieval projects

- Familiar with Data Warehousing concepts and best practices

- Able to recommend and implement appropriate real-time processing solutions

- Strong attention to detail and commitment to data quality.

Specifically, I have a Jira ticket that consists of creating an application that runs on Airflow and connects to an API with survey metadata with values such as if it was opened, if it was answered, etc, then it should generate a zip file with all the data in jsons and save it in a S3 bucket. Once in the bucket you must save the information in a Hive table.

All the code should be from Pyspark and there is a similar application that saves the raw survey data that you can take as reference.

You need to create the table and generate the code, the end client is Expedia and it should be done using their environment, I would give you the credentials and ask what you need with my colleagues.

If you like we can have a video call to explain everything better.

GitHub Hive PySpark

Project ID: #38088526

About the project

13 proposals Remote project Active last week

13 freelancers are bidding on average $463 for this job

katiegspinkss

With my background as an experienced developer and a passion for data engineering, I believe I'm the ideal candidate for this job. My proficiency in Pyspark and data storage and retrieval projects is evident from my ac More

$500 USD in 7 days
(1 Review)
2.5
ajeshjanardanan

Hi Cecilio C., How are you doing? As a professional mechanical and civil expert with expertise inPySpark, GitHub and Hive, I eagerly anticipate the opportunity to complete this project for you. Please drop me a message More

$250 USD in 3 days
(0 Reviews)
0.0
LiveExperts

Hi there,I'm biddin on your project "Data Engineering Specialist Needed"GitHub, Hive and PySpark I'm looking for a data engineer with solid Pyspark knowledge to assist in developing a robust data storage and retrieval More

$750 USD in 5 days
(0 Reviews)
0.0
chatgptsuperpow8

As a Senior Full Stack Engineer and Team Lead, my primary objective has always been to provide superior service while maintaining cost-efficiency. With 12 years of successful projects under my belt, including numerous More

$250 USD in 4 days
(0 Reviews)
0.0
Eddiie420

As a Data Engineering Specialist, I bring over a decade of diverse programming expertise to the table, including my adeptness in Pyspark - the tool you've specifically emphasized. I have successfully implemented data s More

$250 USD in 4 days
(0 Reviews)
0.0
intelliresponse

As a Senior Full Stack Engineer with over 12 years of experience, I have inevitably grown to become highly proficient in a multitude of tools and technologies that are relevant to your project, including PySpark and Gi More

$250 USD in 3 days
(0 Reviews)
0.0
paul396

Pyspark + Datawarehousing Expert is here. Please message me.

$500 USD in 7 days
(0 Reviews)
0.0
souyah

Hi I am a Data Engineer having strong experience in PySpark. Hope to discuss the project details. Thank you

$500 USD in 7 days
(0 Reviews)
0.0
bhupen4work

Hello, I have carefully read your project requirement. With my proficient in Pyspark, I am sure, I can fulfill your requirement and get desired results. I am ready to start the project right away. Regards, Bhupendra

$500 USD in 7 days
(0 Reviews)
0.0
imumermalik

I am confident in my ability to develop a robust data storage and retrieval system using Pyspark for a Data Warehouse project. My solution includes implementing data quality checks, real-time processing, seamless integ More

$675 USD in 5 days
(0 Reviews)
0.0
datacode0023

Hi, I have +7 years of experience dealing with machine learning algorithms and worked on multiple projects in this field, Please contact me to discuss more. Have a nice day

$500 USD in 7 days
(0 Reviews)
0.0
Neerajkkk13

Hi there, I am a data engineer having 8 years of experience in data engineering, data warehousing, ETL using technologies like Python, Pyspark, SQL, AWS services, RDBMS etc. I have worked on different kind of problems/ More

$600 USD in 7 days
(0 Reviews)
0.0