Data Scientist - Unstructured Data Extraction, Gen AI, Python, NLP, Remote in Europe (h/f)
Job description:
Are you a Data Scientist skilled in data extraction, generative AI, and natural language processing? ---------------------------------------------------------------------------------------------------- emagine are seeking your skills to develop a tool for extracting information from unstructured data. You will use a deep understanding of machine learning algorithms and Python programming to automate the transformation of unstructured data into a structured format. **Key Requirements** * Expertise in data extraction * Expertise in generative AI * Advanced Python programming skills * Confirmed experience in natural language processing * Confirmed experience with Microsoft Azure * Confirmed knowledge of machine learning techniques * Fully fluent and comfortable working in English **Nice to Have** * Experience with cloud computing environments * Familiarity with various data formats and preprocessing techniques **Main Responsibilities** The selected candidate will have the following key responsibilities: * Evaluate and test external tools or build a robust extraction tool for unstructured data * Ensure compatibility with various data types and formats * Incorporate advanced natural language processing techniques * Utilize machine learning algorithms to improve adaptability and efficiency * Provide comprehensive documentation and training materials for end-users **Other Details** This position offers a flexible remote working arrangement. The project duration is expected to span several months, focusing on enhancing data processing mechanisms .