Woven by Toyota is seeking a Software Engineer - Data Infrastructure in London to power a data platform that accelerates autonomous driving by providing fast, cost effective access to petabytes of vehicle data. You’ll build and operate ingestion pipelines from the fleet, design distributed cloud systems, and collaborate with ML and Computer Vision engineers on high impact projects. Key skills include Python, building reliable data processing apps, RPC with gRPC and protobuf, and hands on cloud experience (AWS, GCP, Azure); experience with Flyte, Kafka, SQS, BigQuery and ElasticSearch is a plus. They value modular, testable code and cross team collaboration. To apply, tailor your resume to data infra, give concrete results, and show a passion for mobility safety. Woven by Toyota is an equal opportunity employer with a privacy notice.
Team
Our data platform team is working on accelerating autonomous driving by providing access to petabytes of data collected by our fleet of autonomous and non-autonomous vehicles. Efficient, fast and cost-effective access to data at a large scale is key to tackle the hardest problems in AD/ADAS, from developing the Machine Learning (ML) models for perception and prediction of human driving patterns, to increasing the sophistication of our validation and simulation by identifying rare and interesting real-world driving situations. The data ecosystem developed by the London team is a key building block for developing and testing modern AD/ADAS products that will impact millions of customers.
Our ML and Data pipelines are built on top of the open-source Flyte orchestration framework and are deployed to AWS. Pipeline code is written in Python. We use SQS and Kafka to automate data connections and leverage BigQuery and ElasticSearch for data storage. We believe strongly in automation and testing to ensure the delivery of robust and correct systems. We are a distributed team, working in the UK and US.
Who are we looking for?
The London Data Infrastructure team is looking for engineers who are passionate about and enable the next generation of automotive software development. The right candidate will have excellent communication skills, solid coding skills, broad knowledge of software development across areas such as Data Infrastructure and Warehouses, Data Ingestion, Compute Frameworks, Observability, and Build Infra.
Work on high-impact projects and innovate new solutions to problems in the self-driving space.
Work with Computer Vision and Machine Learning engineers on high-impact projects and innovate new solutions to problems in the self-driving space.
Understand the complex data requirements of modern ML development and tailor our data ecosystem to these needs.
Build efficient data pipelines for ingestion from the vehicle fleet.
Work on distributed systems that serve, process and transform large quantities of data in the cloud.
Extensive experience in Python (or other object-oriented language).
Experience building reliable, distributed applications for Data Processing or similar areas.
Working with RPC protocols such as gRPC/protobuf.
Hands-on experience developing cloud applications (e.g. AWS, GCP, Azure).
Experience writing testable and modular code.
Experience working in a fast-paced environment, collaborating across teams and disciplines.
Experience designing, deploying, and maintaining distributed systems.
Data pipelines, data platforms, workflow orchestration, batch processing.