The vacancy is detailed in tasks and tech stack but lacks compensation clarity and company information.
Job description
Looking for an Analyst/Developer specializing in NLP with strong SQL and Python skills. Remote position available.
Responsibilities
• Confident knowledge of SQL and experience with popular DBMS or distributed data storage.
• Proficient in Python and main stack for data analysis and visualization: pandas, numpy, polars, matplotlib, seaborn, altair.
• Understanding of basic NLP concepts and desire to develop in this direction.
• Experience in developing ETL and ELT pipelines.
• Basic knowledge in Data Science is a big plus.
• Work with data from HDFS and S3, from databases (GreenPlum, OracleDB, PostgreSQL), as well as from file shares and network drives.
• Prepare data visualizations on Superset and Streamlit.
• Develop data preparation pipelines for training and testing models.
• Analyze data, build and validate hypotheses using Python (pandas, polars) and SQL.
• Handle tasks related to labeling unstructured data: from designing the labeling process to validating results.
• Analyze the performance of existing GenAI/NLP services.
Requirements
• Strong knowledge of SQL and experience with popular DBMS or distributed data storage.
• Proficiency in Python and main stack for data analysis and visualization: pandas, numpy, polars, matplotlib, seaborn, altair.
• Understanding of basic NLP concepts.
• Experience in developing ETL and ELT pipelines.
• Basic knowledge in Data Science is a plus.
• Experience with data from HDFS and S3, databases (GreenPlum, OracleDB, PostgreSQL), file shares, and network drives.
Conditions
• Remote work opportunity.
• Special offer available.