NLP Data Engineer

OriginAI, is a new innovative AI center, the result of a collaboration between Axon Vision, Nvidia and Elbit Systems, that was established to address challenges at the forefront of AI research at national scale
The AI Center, located at Tel-Aviv, will engage in research and development in the areas of Vision, Speech and NLP with the aim of producing world-class groundbreaking capabilities in these areas
The center is well funded with direct access to the most advanced computing resources in the industry

Job Description

Working closely with the with the NLP Researches, an NLP Data Engineer will be a part of the research and development cycle, facilitating the processes of data management, acquisition, cleansing, processing and annotation


  • You are a multi-task, resourceful imaginative data engineer, practicing a can-do approach and willing to learn something new every day. You are enthusiastic about understanding human language through text and data. You are ready to jump in at any time and wear many hats because
  • you have the best tech skills.
  • Most of your responsibilities are (but not limited to):
  • Design, build, and test pipelines and platforms to streamline the development of NLP DL/ML models
  • Acquiring, processing and manipulating textual and other relevant data to produce valuable Data Sets used for training NLP models
  • Designing and executing of the Data Labeling and Data Enrichment processes
  • Dealing with extensive Data Sets, developing efficient ways for data storing and serving
  • Managing Big Data repositories and indexes


  • B.Sc in Computer Science or is a graduate of a military technology unit
  • 2-3 years’ experience as an NLP Data Engineer
  • 2+ years programming in Python
  • Experience in Pandas, Spacy, NLTK and Gensim
  • Experience working with large volumes of data and distributed systems (Hadoop, Spark)
  • Analytical mind with a problem-solving attitude
  • Quick learner, a team player, independent and motivated individual
  • Able to multitask, prioritize, and manage time efficiently
  • Excellent interpersonal relationship skills


  • Experience in Machine Learning or Deep learning framework as Scikit-learn, Pytorch, Tensorflow
  • Understanding of NLP techniques for text representation, semantic extraction, data structures and modeling
  • Experience with the ELK stack and NoSQL Databases
  • Previous experience in other AI based products (voice, image)
  • Knowledge in Apache AirFlow or other ETL frameworks
  • Knowledge in Scripting and Automation – Jenkins, Groovy

To apply for this job please visit

פרסם משרה