
Data Scientist
About the Role
As a Data Scientist on Cyera’s Research team, you’ll develop and productionize ML/AI models that power our classifications and insights. You’ll partner closely with Product and Engineering to turn open-ended data-security challenges into measurable experiments and shipped features. You’ll own the end-to-end lifecycle—from problem framing and data strategy to evaluation, deployment, and ongoing monitoring—helping customers discover, protect, and govern their data at scale.
What You’ll Do
- Responsibility for an end-to-end research process. That includes identifying the problem, feature engineering, model development to deployment, outcome analysis, and refinement.
- You will be a hands-on domain leader, laying the foundations of our data science workflows and algorithms. This is an excellent opportunity to work with endless amounts of user data and creatively generate insights that will increase the ability to classify tons of data.
- Develop, evaluate, and maintain machine learning and NLP solutions to enhance Cyera’s core capabilities in sensitive data classification.
- Innovation and creative thinking are the keys! Implementing ML models to the entire research process – clustering, text extraction, document analysis, and tabular data classification.
- Close interaction and collaboration with an excellent team of engineers, data analysts, and security researchers.
Requirements
Who You Are
Must-Haves
- BSc in computer science, math, physics, or a related field
- 5+ years of experience as a data scientist
- Experience with data pipelines / big-data analytics – Must
- Solid grounding in core machine learning concepts and techniques – classic ML (SVM, trees, bagging/boosting, clustering methods), neural networks (activations, dropout, batch norm), optimization (loss functions, gradient descent, regularization), data & features (imbalance, scaling/encoding, dedup/leakage), evaluation (PR/ROC metrics, ablations, error analysis).
- Demonstrated expertise in applying LLMs – prompt engineering and prompt tuning (few-shot, chain-of-thought, tool/function calling, routing), task adaptation (instruction/SFT, PEFT/LoRA, DPO/RLHF), retrieval-augmented generation, rigorous evaluation and production deployment with appropriate safety, latency, and cost controls.
- 3+ years of hands-on experience in programming in python or a similar scripting language.
- Self-learner, initiator, able to quickly learn new technologies
- Experience in NLP – a significant advantage
- MSc in computer science, math, physics, or related field – advantage
Similar jobs


