Job was saved successfully.
Job was removed from Saved Jobs.
Data Engineer- Thoughtworks (2247071951)
The role is going to be Data Engineer with our client Thoughtworks. Please find below the job description for the position. Please send the following documents to if that interests you and matches your profile. Without mandatory documents, we cannot submit a candidate. Updated Resume in word format (Mandatory) Expected hourly rate (Mandatory) Kindly ignore if this requirement does not match your current or preferred job profile, it would be really appreciated if you can refer any of your friends/colleagues. Duration : 6 months Contract with a possibility of extension. Job Description: Data Engineers develop modern data architecture approaches to meet key business objectives and provide end-to-end data solutions. You might spend a few weeks with a new client on a deep technical review or a complete organizational review, helping them to understand the potential that data brings to solve their most pressing problems. On other projects, you might be acting as the architect, leading the design of technical solutions, or perhaps overseeing a program inception to build a new product. It could also be a software delivery project where you're equally happy coding and tech-leading the team to implement the solution. You’ll spend time on the following: You will partner with teammates to create complex data processing pipelines in order to solve our clients’ most ambitious challenges You will collaborate with Data Scientists in order to design scalable implementations of their models You will pair to write clean and iterative code based on TDD Leverage various continuous delivery practices to deploy, support and operate data pipelines Advise and educate clients on how to use different distributed storage and computing technologies from the plethora of options available Develop and operate modern data architecture approaches to meet key business objectives and provide end-to-end data solutions Create data models and speak to the tradeoffs of different modeling approaches Seamlessly incorporate data quality into your day-to-day work as well as into the delivery process Here’s what we’re looking for: You have a good understanding of data modelling and experience with data engineering tools and platforms such as Kafka, Spark, and Hadoop You have built large-scale data pipelines and data-centric applications using any of the distributed storage platforms such as HDFS, S3, NoSQL databases (Hbase, Cassandra, etc.) and any of the distributed processing platforms like Hadoop, Spark, Hive, Oozie, and Airflow in a production setting Hands on experience in MapR, Cloudera, Hortonworks and/or cloud (AWS EMR, Azure HDInsights, Qubole etc.) based Hadoop distributions You are comfortable taking data-driven approaches and applying data security strategy to solve business problems You’re genuinely excited about data infrastructure and operations with a familiarity working in cloud environments Working with data excites you: you can build and operate data pipelines, and maintain data storage, all within distributed systems Assure effective collaboration between ThoughtWorks’ and the client’s teams, encouraging open communication and advocating for shared outcomes Powered by JazzHR z7vSNCjhwx