Lead Data Scientist

SW5 Consulting

New York, NY, USA
Permanent
Hybrid
$150,000 - $210,000/year
PythonOpen- source LLM'sNLP

Opportunity for a Senior Data Scientist (LLM/NLP Focus)

Do you thrive at the intersection of language, logic, and learning systems? A confidential team is seeking a seasoned builder of smart systems — someone who speaks fluent Python and thinks in embeddings.

This is a hands-on role for someone who’s been deep in the weeds of GenAI, LLMs, and NLP — and wants to push boundaries in applied research with real-world impact.

What You’ll Be Doing

  • Crafting custom models for stream and batch pipelines — think GenAI, LLMs, NLP, and ML.
  • Working across ingestion, retrieval, RAG, fine-tuning, and prompt design.
  • Partnering with internal teams to make sure your models don’t just work — they work well.

Who You’ll Be Working With

  • Product thinkers, engineers, and ML folks who care about performance and precision.
  • A collaborative crew that values experimentation and iteration.

What You Bring

  • Advanced degree in a technical field (CompSci, Stats, Linguistics, etc.).
  • Experience with tuning open-source LLMs and staying current with open-source trends.
  • 8+ years wrangling structured/unstructured data for insights and automation.
  • 3+ years hands-on with Python and libraries like Hugging Face, PyTorch, TensorFlow.
  • Deep experience with transformer-based NLP models and semantic search.

Logistics & Perks

  • Hybrid setup — you’ll need to be able to drop into a workspace in the Northeast US (NY/NJ area).
  • Compensation: $150k–$210k base + bonus + benefits + sign-on