DESCRIPTION:
Duties: Assemble large data sets that meet functional / non-functional business requirements. Build the infrastructure with help of Terraform required for optimal extraction, transformation, and loading of data from a wide variety of data sources using SQL and AWS Data Analytics Services. Develop applications, which extract data from different internal and external sources, and transform and integrate into the Enterprise Data Lake using AWS Services like AWS Glue and EMR. Leverage Apache Spark's distributed in-memory processing, perform data transformations and ingest cleansed and transformed data into AWS Data Lake. Accommodate analytical layer on top of Data lake for the end users by creating Glue Catalog tables and crawlers. Write automated test cases in Java and python using Junit and Pytest respectively. Utilize knowledge on Jenkinsfile libraries to deploy the applications through CICD. Develop efficient data pipelines and orchestrate them in a workflow for automated runs using AWS Step Function. Implement Java Spring application in AWS Lambda to extract data from AWS S3 and stream the data to write to a Kafka topic.
QUALIFICATIONS:
Minimum education and experience required: Bachelor's degree in Electronic Engineering, Computer Science, Computer Engineering, Computer Information Systems, Information Technology, or related field of study plus 5 years (60 months) of experience in the job offered or as a Software Engineer, Senior Consultant, Senior Software Developer, Senior Associate (Projects), or related occupation.
Skills Required: Requires experience in the following: Linux; Unix; Windows; Agile SDLC; Application Architecture Disciplines; Data Architecture Disciplines; Microservices; Apache Kafka; Docker; J2EE; Jenkins; Spring; Java; Javascript; JQuery; Python; Shell Scripting; SQL; XML; Apache Tomcat; Bootstrap; REST; SOAP; Maven; JSON; Kubernetes; Apache Zookeeper; AWS Cloud Services; Dynatrace; Cassandra; OpenShift; Spring AOP; Spring Security; TypeScript; Apache Camel; JProfiler; Hadoop; Hive; Oracle; DB2; Apache Spark; Splunk; GIT; Cucumber; Junit; Performance Testing; System Integration Testing; Unit Testing; and PCF.
Job Location: 8181 Communications Pkwy, Plano, TX 75024