We are currently seeking a Senior Software Engineer to join our growing Data team! Reporting to the Engineering Manager, you will evolve our data models, operationalize production-grade data pipelines, and contribute to our document mining/information retrieval initiatives as we expand our ability to extract valuable insights from scientific publications and databases.
On the BenchSci Data team, you will work with a group of talented and passionate engineers as you play a key role in shaping the future of scientific discovery and progress.
You Will:
Scale data pipelines to allow our data to go from research to platform as fast as possible Collaborate with ML and Data engineers to solve complex document mining challenges, enabling us to capture and model additional scientific experimentsDevelop data access mechanisms for downstream applications consumptionExpand the types of biological reagents and experiments that we can successfully extract from various types of mediaManage sources that contain both semi-structured and unstructured dataApply state-of-the-art data processing engines to rapidly process our data setsEmploy best practices for a broad platform of technologies within a cloud-based environmentProvide troubleshooting analysis and resolution in a timely mannerWork on projects involving some of the largest pharmaceutical companies in the worldSolve difficult problems and bring new perspectives to the team
You Have:
Degree in Software Engineering, Computer Science, or a similar area4+ years of experience working as a professional software engineer, data in industryExpertise with Python and SQLExperience with Retrieval Augmented Generation (RAG)Experience with cloud development (we use GCP and Terraform) including reference architectures and developing specialized stacks on cloud servicesA strong background in data structures and large-scale data manipulation Strong cross-team communication and collaboration skills