We're looking for a talented Senior Software Engineer to join the Identity Data team. The team is in charge of the critical data pipeline that processes and aggregates the huge quantity of events that, over time, build our identity data corpus.
Why should you join us?
The Identity Data team is the place for those who are passionate about data and scale.
We handle >2B person entities stored jointly in various database engines to support different fetching requirements, while processing >200k events per second
The team provides services to all our decisions, generating some of the most critical model inputs
We are one of the major contributors to our decision latency and cloud expense, and as such we are always improving our engines, reducing cost and latency, and have the organizational credit to make high-risk / high-reward investments in cutting edge technologies
If you are up for the challenge of being in one of the most critical teams in the organization, which holds part of the secret sauce of our success, we are the place for you!
What you'll be doing:
Have a key part in improving the systems helping us prevent fraud for our merchants by identifying millions of decisions a day
Design, build, and maintain Near-real-time data pipelines
Take initiative: A large portion of the work are initiatives are pushed by the engineers
Work in a rich ecosystem of storage and data technologies, to name a few: Elasticsearch, Aerospike, Spark, Redis, RocksDB, S3, Snowflake and many more
Provide software solutions to complicated challenges we face on a daily basis.
Requirements: 6 + years of proven experience in designing and building large-scale production systems.
Strong Java and/or Python skills.
Experience (or good familiarity) with streaming technologies such as Storm, Flink, Spark Streaming.
Familiarity with concepts such as Queue/DLQ, Topics and general streaming practices (backpressure, circuit-breaker, etc).
Experience with various databases or data stores such as Elasticsearch, Redis, MySQL, Couchbase, MongoDB, Cassandra, Aerospike, etc.
Experience with AWS or other public clouds.
Excellent communication and interpersonal skills.
It'd be cool if you also: [not a must]
Profiled and optimized code to the millisecond level.
Experience with event streaming frameworks.
Strong written-English skills.
This position is open to all candidates.