Senior Software Engineer, Trino
Treasure Data:
At Treasure Data, we’re on a mission to radically simplify how companies use data and AI to create connected customer experiences. Our intelligent customer data platform (CDP) drives revenue growth and operational efficiency across the enterprise to deliver powerful business outcomes.
We are thrilled that Forrester has recognized Treasure Data as a Leader in The Forrester Wave™: Customer Data Platforms For B2C. It's an honor to be acknowledged for our efforts in advancing the CDP industry with cutting-edge AI and real-time capabilities.
Furthermore, Treasure Data employees are enthusiastic, data-driven, and customer-obsessed. We are a team of drivers—self-starters who take initiative, anticipate needs, and proactively jump in to solve problems. Our actions reflect our values of honesty, reliability, openness, and humility.
Your Role:
The Plazma team at Treasure Data is one of the essential elements of our CDP solution and is part of the Core Services group, which supports customer data ingestion and availability at a rate of 70B records per day. We develop & run the storage and query engine components and enable customers to find and store their data by offering comprehensive solutions based on OSS and proprietary software. You are expected to help the team develop the future of our Trino query engine and expand from there into Hive/Hadoop and our in-house developed storage solution. This includes maintaining technical excellence to address challenges that currently lack industry-wide solutions and delivering the roadmap together with your team. Our team consists of Big Data experts across Japan, Korea and Canada who are passionate about OSS contribution, and we take pride in the quality of service we offer.
Responsibilities & Duties:
Work as a member of the team by designing and developing Trino solutions
Be responsible for providing solution expertise around Trino technologies. This includes technology assessment, use case development, as well as solution outline and design for modern data architectures
Establish standards and guidelines for the design & development, tuning, deployment, and maintenance of advanced data access frameworks and distributed systems
Document architectural and technology advancements
Work with your team to set up the roadmap for Trino-related products based on operational needs and customer-requested features
Mentor and train new members in the team
Version and release management of Trino products
Evaluate, test, and set a base version
Backport any needed patches from trunk, which contains the latest cutting-edge version of the project, but therefore may also be the most unstable version
Deploy new customer-facing features for Trino
Coordinate with support and product teams on product releases
Make contributions to the Trino open source community
Automate Trino cluster operations to reduce operational overhead
Design, develop, and evaluate metrics to ensure system health and plan infrastructure capacity of clusters
Design and develop scripts to automatically start and stop clusters and switch traffic between active clusters for load balancing of customers’ workloads
Design and develop failure recovery tools to automatically detect the occurrence of faults and recover faulty clusters
Provide in-depth support services to Trino customers
Take responsibility for on-call to support Trino customers
Deal with escalations on product defects and performance issues, lead and perform in-depth troubleshooting of Trino-related systems
Required Qualifications:
Work-context English
A BS or higher in Computer Science or equivalent experience
At least 5 years’ experience:
Java
Operating production scale deployments
With MySQL, PostgreSQL or other open-source distributed databases/key-value stores
Solid understanding of cloud architecture and services in public clouds like AWS, GCP, or Microsoft Azure
Deep understanding of distributed systems and their challenges
Experience working with distributed, scalable Big Data stores or NoSQL, including HDFS, S3, Cassandra, Big Table, etc.
Experience in developing use cases, functional specs, design specs, ERDs etc.
A solid understanding of computer science (algorithms, data structures, etc.)
Solid understanding and handling of Big Data problems
Able to work independently as well as in a team
Strong capability in implementing new and improved data solutions for multi-tenant environments
Solid understanding of cloud architecture and services in public clouds like AWS, GCP, or Microsoft Azure
It would be nice if you had:
Deep understanding of the capabilities of Trino
Kotlin and Scala experience
Familiar with microservices-based software architecture
Expertise in Data Integration patterns
Strong track record to drive rapid prototyping and design for Big Data
Experience with extending Free and Open–Source Software (FOSS) or COTS products
Strong IT & Security skill sets and knowledge
Experience with the design and development of multiple object–oriented systems
Good understanding of ‘infrastructure as code’ and operations
Physical Requirements:
3 days at Treasure Data Office
About Treasure Data:
Treasure Data is the Intelligent Customer Data Platform (CDP) built for enterprise scale and powered by AI. Recognized as a Leader by Forrester and IDC, Treasure Data empowers the world’s largest and most innovative companies to deliver hyper-personalized customer experiences at scale that increase revenue, reduce costs, and build trust.
Through unique capabilities such as the Diamond Record, AI Agent Foundry, and AI Decisioning with Real-Time Personalization, Treasure Data enables marketing and CX teams to personalize cross-channel engagement in real-time, optimize marketing spend while increasing ROI, and drive customer lifetime value through more intelligent retention and loyalty.
Our Dedication to You:
We value and promote diversity, equity, inclusion, and belonging in all aspects of our business and at all levels. Success comes from acknowledging, welcoming, and incorporating diverse perspectives.
Diverse representation alone is not the desired outcome. We also strive to create an inclusive culture that encourages growth, ownership of your role, and achieving innovation in new and unique ways. Your voice will be heard, and we will help amplify it.
Agencies and Recruiters:
We cannot consider your candidate(s) without a contract in place. Any resumes received without having an active agreement will be considered gratis referrals to us. Thank you for your understanding and cooperation!
- Department
- Engineering | R&D
- Role
- Senior Software Engineer
- Locations
- Tokyo, Japan
- Remote status
- Hybrid
- Employment type
- Full-time
- Employment level
- Professionals
About Treasure Data
Treasure Data is the Intelligent Customer Data Platform (CDP) built for enterprise scale and powered by AI. Recognized as a Leader by Forrester and IDC, Treasure Data empowers the world’s largest and most innovative companies to deliver hyper-personalized customer experiences at scale that increase revenue, reduce costs, and build trust.
Through unique capabilities such as the Diamond Record, AI Agent Foundry, and AI Decisioning with Real-Time Personalization, Treasure Data enables marketing and CX teams to personalize cross-channel engagement in real-time, optimize marketing spend while increasing ROI, and drive customer lifetime value through more intelligent retention and loyalty.
Already working at Treasure Data?
Let’s recruit together and find your next colleague.