1

Data Scientist Nlp Jobs (NOW HIRING)

We're hiring a Data Scientist focused on natural language processing to build models that turn ... Design and train NLP models for tasks like classification, entity extraction, retrieval ...

Sr Data Scientist GenAI

Dallas, TX ยท On-site +1

$150K - $210K/yr

Opportunity for advancement Sr Data Scientist (NLP / LLM / Generative AI) Location: Dallas, TX Roles & Responsibilities : - Design, build, fine-tune, and deploy LLMs, transformer-based NLP models ...

Sr Data Scientist GenAI

Dallas, TX ยท On-site +1

$150K - $210K/yr

Opportunity for advancement Sr Data Scientist (NLP / LLM / Generative AI) Location: Dallas, TX Roles & Responsibilities : - Design, build, fine-tune, and deploy LLMs, transformer-based NLP models ...

next page

Showing results 1-20

Data Scientist NLP information

See salary details

$37.5K

$122.7K

$196.5K

How much do data scientist nlp jobs pay per year?

As of Jun 8, 2026, the average yearly pay for data scientist nlp in the United States is $122,738.00, according to ZipRecruiter salary data. Most workers in this role earn between $98,500.00 and $136,000.00 per year, depending on experience, location, and employer.

What are the typical projects or challenges faced by Data Scientist NLP professionals in their daily work?

Data Scientist NLP professionals often tackle projects involving text classification, sentiment analysis, entity recognition, machine translation, and language modeling using large datasets. They regularly address challenges such as cleaning and preprocessing noisy text data, tuning models for accuracy, and keeping up with rapidly evolving NLP techniques. Collaboration with data engineers, product managers, and subject matter experts is a key part of the role to ensure the developed models solve real business problems. This dynamic environment offers numerous opportunities to innovate and advance technically, especially as NLP technologies continue to mature.

What is a Data Scientist NLP job?

A Data Scientist specializing in Natural Language Processing (NLP) focuses on analyzing and interpreting human language using machine learning and computational techniques. They work with large text datasets to build models for tasks like sentiment analysis, text classification, language translation, and chatbot development. Their role involves data preprocessing, model training, evaluation, and deployment in real-world applications. Proficiency in NLP libraries such as spaCy, NLTK, and transformers, along with programming skills in Python, is essential.

What are the key skills and qualifications needed to thrive in the Data Scientist Nlp position, and why are they important?

To thrive as a Data Scientist NLP, you need a strong background in statistics, machine learning, and natural language processing, often supported by an advanced degree in computer science, data science, or a related field. Experience with languages and tools such as Python, TensorFlow, PyTorch, spaCy, and relevant NLP libraries, along with familiarity with cloud platforms, is highly valuable. Excellent problem-solving, communication skills, and the ability to distill complex information for non-technical audiences make a candidate stand out. These competencies are crucial to building effective language models and delivering actionable insights from large volumes of unstructured data in real-world business contexts.

More about Data Scientist NLP jobs
What cities are hiring for Data Scientist Nlp jobs? Cities with the most Data Scientist Nlp job openings:
What are the most commonly searched types of Data Scientist Nlp jobs? The most popular types of Data Scientist Nlp jobs are:
What states have the most Data Scientist Nlp jobs? States with the most job openings for Data Scientist Nlp jobs include:

Data Scientist - NLP & AI

VeeRteq Solutions Inc.

Houston, TX โ€ข On-site

Other

Posted 22 days ago


Job description

Data Scientist โ€“ NLP & AI

Experience:ย 7+ YearsLocation:ย Houston, TX โ€“ At least two days in office/weekWhat is in it for you?

As a Data Scientist โ€“ NLP & AI, you will be part of an agile team focused on building intelligent healthcare solutions by developing advanced NLP modules, integrating LLMs and agentic workflows, and leveraging AWS big data technologies to enhance clinical data processing and usability.

Responsibilities:
  • Analyze and process clinical textual data using AI-powered NLP techniques and advanced machine learning models.

  • Modify and improve current workflows by incorporating cutting-edge machine learning and deep learning algorithms, including leveraging large language models (LLMs) and tools like LangGraph for complex AI agentic workflows in healthcare contexts.

  • Develop NLP modules within the NLP development team using programming or scripting languages such as Python.

  • Conduct pre-processing and quality analysis for textual data inputs and validate performance of NLP outputs.

  • Create systematic testing procedures, error-checking mechanisms, and user manuals for NLP modules.

  • Build infrastructure for optimal extraction, transformation, and loading of data from diverse sources including MCP servers, using SQL and AWS big data frameworks such as EMR and Spark/pySpark.

  • Collaborate with Engineering teams to ensure scalable and efficient data workflows using SQL and AWS big data technologies.

  • Apply working knowledge of AWS services, particularly AWS Bedrock, to develop generative AI applications.

  • Utilize relational databases such as PostgreSQL or MySQL for data storage and retrieval in NLP and AI workflows.

Skills:Mandatory skills
  • Proficiency in Python and scripting languages for NLP and machine learning development.

  • Strong understanding of clinical NLP techniques and experience with machine learning and deep learning models.

  • Hands-on experience with large language models and agentic workflow tools such as LangGraph.

  • Expertise in SQL and big data technologies including AWS EMR and Spark/pySpark.

  • Practical knowledge of AWS services, especially AWS Bedrock for generative AI applications.

  • Experience with relational databases such as PostgreSQL or MySQL.

Good to have skills
  • Familiarity with generative AI applications in healthcare and related use cases.

  • Understanding of healthcare data standards and terminologies such as HL7, FHIR, and CCDA.

  • Experience in creating detailed documentation, user manuals, and technical specifications.

  • Background in automated testing and validation frameworks for NLP outputs.

  • Ability to collaborate effectively with cross-functional teams including engineering and products.

  • Exposure to LangChain or similar frameworks for building intelligent agent workflows.

Educational Qualifications:

Engineering Degree โ€“ BE/ME/BTech/MTech/BSc/MSc.
Technical certification in multiple technologies is desirable.