Cloud Support Engineers in the Data in Transit domains support customers who are running ETL workload or analyzing large amounts of data using AWS services.
As a part of this team, you will be working on a plethora of services such as Glue (ETL service), Athena (interactive query service), Managed Workflows of Apache Airflow, etc.
Understanding of ETL (Extract, Transform, Load) Creation of ETL Pipelines to extract and Ingest data into data lake/warehouse with simple to medium complexity Data transformations and troubleshooting ETL job issues.
Understanding of Linux and Networking concepts.
Excellent oral and written communication skills with multi-tasking ability.
Master's degree in Information Science/Information Technology, Data Science, Computer Science, Engineering, Mathematics, Physics, or a related field OR Bachelor's degree in the same with 1+ year of experience OR equivalent experience in a technical position.
Key job responsibilitiesIntermediate expertise in ETL tools such as Talend, Informatica or similar.Knowledge of data management fundamentals and data storage principles.Advanced SQL and query performance tuning skills.Experience integrating and managing large data sets from multiple sources.Ability to read and understand Python and Scala code.Understanding of distributed computing environments.Proficient in Spark, Hive and Presto.Experience working with Docker.Python, and shell scripting.Customer service experience / strong customer focus.Prior working experience with AWS - any or all of EC2, S3, EBS, Glue, Athena.Experienced with Linux system monitoring and analysis (disk management, memory management, permissions etc.
).Understanding of Networking concepts and protocols (DNS, TCP/IP, DHCP, HTTPS, etc.
).BASIC QUALIFICATIONS- 2+ years of experience in big data/Hadoop with excellent knowledge of Hadoop architecture and administration and support.
- Be able to read Java code, and basic coding/scripting ability in Java, Perl, Ruby, C#, and/or PHP with Databases (MySQL, Oracle, NoSQL) experience.
- Good understanding of distributed computing environments and excellent Linux/Unix system administrator skills.
PREFERRED QUALIFICATIONS- Proficient in Hadoop Map-Reduce and its Eco System (Zookeeper, HBASE, HDFS, Pig, Hive, Spark, etc).
- Good understanding of ETL principles and how to apply them within Hadoop.
- Prior working experience with AWS - any or all of EC2, S3, EBS, ELB, RDS, Dynamo DB, EMR.
Posted: March 5, 2025 (Updated 1 day ago)
#J-18808-Ljbffr