We Are Looking For As a data Scientist, you will drive clustering of adversarial prompts and build an automation for GenAI red teaming and sandboxing across models and providers. Were looking for a hands-on technologist with deep experience in data clustering, Big Data, Machine Learning, and predictive modeling. Key Responsibilities
* Manage and analyze prompt data from multiple sources; clean, curate, normalize, and tag it for analysis.
* Analyze large volumes of structured and unstructured data to uncover trends, clusters, and anomalies.
* Develop ML models and predictive algorithms to automate redteaming (prompt generation, mutation, clustering, prioritization, labeling).
* Use statistical techniques and experiments to validate findings and ensure accuracy and reproducibility.
* Sandboxing and creation of safe environments for testing the models
* Evaluate prompts across GenAI models and endpoints
* Excellent communication, documentation, and crossteam collaboration skills
About ActiveFence:
ActiveFence is the leading provider of security and safety solutions for online experiences, safeguarding more than 3 billion users, top foundation models, and the worlds largest enterprises and tech platforms every day. As a trusted ally to major technology firms and Fortune 500 brands that build user-generated and GenAI products, ActiveFence empowers security, AI, and policy teams with low-latency Real-Time Guardrails and a continuous Red Teaming program that pressure-tests systems with adversarial prompts and emerging threat techniques. Powered by deep threat intelligence, unmatched harmful-content detection, and coverage of 117+ languages, ActiveFence enables organizations to deliver engaging and trustworthy experiences at global scale while operating safely and responsibly across all threat landscapes.
Hybrid:
Yes
Requirements: Requirements Must-Have
* 5+ years programming in Python or R, SQL.
* 3+ years experience with ScikitLearn and PyTorch.
* Strong grasp of clustering, embeddings/ NLP, and anomaly detection.
* ExperienceCradle, Apache Hadoop, Spark; experience scaling ETL and feature pipelines.
Nice-to-Have
* M.Sc. in CS/EE/Math or related; Ph.D. is an advantage.
* data visualizations with Tableau, Power BI, matplotlib.
* Experience with AWS including Lambda, S3, EC2.
* Experience running inference on GenAI models via Hugging Face, Bedrock, and Azure.
This position is open to all candidates.