Skip to content

Senior Data Scientist with NLP

Remote
  • Wrocław, Dolnośląskie, Poland

Job description

You’ll be working on diverse machine learning projects for local and international companies as well as in academic research. This will involve different phases of the end-to-end delivery – direct contact with the client, business analysis of the problem, coming up with an appropriate solution, implementation and moving it to the production environment. 

Job requirements

Primary qualifications:

  • Master of Science in a technical field: computer science, mathematics, physics, and similar

  • experience and practical knowledge of supervised and unsupervised machine learning / deep learning environments with practical experience and understanding of neural networks and natural language processing models, and related applications.

  • practical working experience in analyzing different types of data forms – unstructured, semi-structured, and structured data.

  • experience with natural language processing methodologies, including BERT, and have deep knowledge of comparing outcomes from different environments.

  • practical understanding of the transformers, querying from LLMs, and combining Generative AI (not Open AI GPT) with Traditional Predictive / ML-based AI

  • be able to construct, train an unstructured data model environment to perform sentiment analysis, named entity and object recognition, queries, classification, info extraction, etc.; molding with predictive AI to achieve our goals of extracting intelligence at speed from different data types.

  • Contribute and maintain whilst overseeing a high-quality code base supported with tests that will provide a high level of functional production environment coverage and supporting related work such as load testing, unit testing, integration testing etc.

  • Utilize programming languages and environments – AWS (high focus), and Python, Container Orchestration services (Docker and Kubernetes) in line with our current tech stack.

  • commercial experience with Python and its data science-oriented packages – e.g. NumPy, Pandas, scikit-learn, Jupyter

  • knowledge of basic statistical and probabilistic concepts (e.g. random variable, distribution, likelihood)

  • in-depth knowledge of a specific ML area (e.g. computer vision, NLP, reinforcement learning, mathematical modeling, time series, neural networks, gradient boosting)

  • experience in projects using natural language processing or computer vision

  • familiarity with relational databases and SQL

  • experience in using Linux and command line

  • familiarity with Git and Docker

  • highly developed analytical thinking and problem-solving skills

  • ability to work in a team

  • willingness to learn and develop skills

  • communicative English – minimum B2 level


It is great if you have:

  • experience with DevOps / MLOps tools and practices (e.g. Docker, Kubernetes, MLFlow, KubeFlow, DVC)

  • familiarity with a deep learning framework (Tensorflow, PyTorch)

  • experience in using additional data science related libraries (e.g. nltk, opencv, scikit-image, gensim, plotly, seaborn, xgboost, lightgbm)

  • strong general software development skills and knowledge of best practices

  • algorithmic and code optimization skills

  • knowledge of a cloud platform and experience in running cloud-based projects (GCP, AWS, Azure)

Salary:

19 000 - 30 000 PLN + VAT (B2B)

We offer you:

  • working with the newest machine learning technologies

  • budget on self-development per year

  • possibility to contribute to a variety of interesting projects

  • internal workshops

  • personal branding (articles, conference speaker, internal workshop leader)

  • flexible work hours

  • remote work possibility

  • chillout room / free beverages / team & company events

  • friendly atmosphere

  • MultiSport 

  • LuxMed 

or