Scala,<br /> Scala,<br /> Our hearts are with Scala<br /> <br /> Scala is the core tech behind our big data pipelines. These pipelines dig deep into online news, blogs, social media, TV radio and traditional print from nearly all countries on the planet. We receive double-digit millions articles, posts and videos each day, with over 100 data points for each media element and getting this fed in hundreds of languages.
This position is in the Data & AI team. Due to its strategic nature, it's hands-on led by the CTO directly. You'll be part in executing and shaping Hypefactor's data & AI roadmap through this daily interaction with senior C-level management.
You’ll work with other specialists, each an expert in a facet of either data engineering and/or AI. You’ll also regularly touch base with backend engineers to align the data pipelines with the product. Occasionally you’ll collaborate with our account managers to build data integrations with external data partners.
Your Responsibilities
Participate in business requirements elucidation with technical feasibility assessments
Hands-on development of batch pipelines of media data with publishers
Hands-on development of concurrent & distributed streaming pipelines, such as Hypefactor’s in-house web crawling cluster and enterprise firehoses of social media platforms like Twitter and Facebook
Conduct reliability assessments of data-pipelines
Build data quality monitoring systems
DevOps of said data-pipelines in Google Cloud
Coordinate with backend engineers its integration into the Hypefactors applications
Your Basic Qualifications
Deep know-how of Scala
Plan & execute software development projects
Build IO-heavy concurrent software architectures around multi-threading, actors, tasks and other asynchronous/concurrency computing concepts
Architecting, querying, indexing & optimising NoSQL databases
Process big data in a distributed manner with a modern big data framework, like Apache Beam
Utilize cloud computing and services, such as Google Cloud
Your Desirable Qualifications
Monix
Akka streams
ElasticSearch
Apache Beam
Google Dataflow
Google PubSub
Your Application
Please address and tailor your resume to Dr. Viet Yen Nguyen, CTO. Our hiring process generally proceeds in three phases: orientation-phase with at least two interviews, followed by a testing-phase with a coding, aptitude and personality tests and concluded by an offer-phase with an overall evaluation.
Start date: December 2019 or January 2020
IMPORTANT: please include the SHA1 hash of the string "We breath Scala" in the short message. Our system prefilters applications that include the correct SHA1 hash.
This job comes with several perks and benefits