Data Scientist / Engineer
Full time
Houston, TX
5 - 50 Years of Experience
Location: Houston, TX

Clearance: DHS Suitability - nice to have

US Citizenship is required

Experience: 5+ years

Remote/Hybrid/Onsite: 100% Onsite work (non-negotiable)

Skill Level Desired Years Exp
Senior 5-10
Expert 11-15
Subject Matter Expert 16+

Full Time: Direct Hire 

No 3rd Party Vendors -  No exceptions

Note: Available Immediately & Benefits posted below

Introduction

Our client seeks a skilled Data Scientist / Data Engineer to join our team and support the Houston Field Office. In this role, you will analyze large and complex datasets to aid in cybercrime investigations, develop ETL processes, and collaborate closely with cybercrime investigators and forensic analysts to enhance investigative outcomes.

Role

-        Analyze large and complex datasets to identify patterns, anomalies, and trends that can aid in cybercrime investigations.

-        Utilize advanced statistical methods and algorithms to extract actionable insights from data.

-        Develop and implement ETL (Extract, Transform, Load) processes to ensure data is efficiently processed and integrated.

-        Design and optimize data indexing and search processes to facilitate efficient data retrieval and discovery.

-        Work closely with cybercrime investigators and forensic analysts to provide data-driven insights that enhance investigative outcomes.

-        Participate in cross-functional teams to integrate data engineering methodologies into ongoing investigations.

-        Create data visualizations and dashboards to present findings in an easily interpretable manner for both technical and non-technical stakeholders.

-        Prepare detailed reports and presentations for internal and external audiences.

-        Contribute to the development and refinement of internal analytics tools and platforms, enhancing the overall effectiveness of data-driven investigations.

-        Develop prototypes and algorithms to support specific investigative needs.

-        Understand and apply Large Language Models (LLMs) to traditional Natural Language Processing (NLP) problems.

-        Utilize LLMs to triage unstructured text and extract relevant information for investigations.

-        Bachelor’s degree in data science, Statistics, Computer Science, or a related field.

-        Minimum of 5+ years in data analytics or data engineering, preferably with a focus on cybercrime or financial investigations.

-        Proficiency in programming languages such as Python, R, or similar.

-        Proficiency with ETL processes and data integration.

-        Strong knowledge of data indexing, search, and discovery techniques.

-        Proficiency in database systems and SQL.

-        Strong analytical and problem-solving skills.

-        Familiarity with data visualization tools (e.g., Tableau, Power BI).

-        Excellent communication skills, with the ability to explain complex technical concepts to non-technical audiences.

-        Ability to work collaboratively in a team environment.

-        Experience in developing and implementing algorithms and prototypes.

-        Understanding of Large Language Models (LLMs) and their application to NLP problems.

-        Experience with traditional NLP techniques and tools.

-        Experience in cybercrime investigations or a related area.

Salary/Rate:

You know the salary range you are looking for, so let's talk after you fill out the application.

 Note:
  • No 3rd party vendors or candidates
  • US Citizenship Required
  • Active DHS Suitability nice to have

 

 

Apply to Job
You can choose to drop your resume to complete your application faster. You can still fill your profile manually.

Upload Resume *

Drag & drop resume/CV or Upload file
Supports doc, docx, txt, rtf & pdf only, smaller than 10MB
Personal Information

Full Name *

Phone *

Location *

Country

LinkedIn Profile

Candidate Profile

Cover Letter

Drag & drop File or Upload file
Additional Questions

Are you a US Citizen? *

Are you a 3rd Party Vendor submitting a candidate? *

Have you ever been dismissed, suspended, or allowed to resign for cause? If so, please explain. *

Are you a 3rd Party Candidate submitting your candidacy for a 3rd party vendor? *

If offered a position by our client, please choose one of the following ways to work for them. *

Click to view options

This is not a remote or hybrid opportunity, Are you willing to work 100% onsite in Houston, TX? *

How Many years of professional experience do you have as a Data Scientist or Data Engineer? *

How Many years of professional experience do you have with developing and implementing ETL (Extract, Transform, Load) processes to ensure data is efficiently processed and integrated? *

How Many years of professional experience do you have analyzing large and complex datasets to aid in cybercrime investigations? *

How Many years of professional experience do you have collaborating closely with cybercrime investigators and forensic analysts to enhance investigative outcomes? *

How Many years of professional experience do you have analyzing large and complex datasets to identify patterns, anomalies, and trends that can aid in cybercrime investigations? *

How Many years of professional experience do you have utilizing advanced statistical methods and algorithms to extract actionable insights from data? *

How Many years of professional experience do you have developing and implementing ETL (Extract, Transform, Load) processes to ensure data is efficiently processed and integrated? *

How Many years of professional experience do you have working closely with cybercrime investigators and forensic analysts to provide data-driven insights that enhance investigative outcomes? *

How Many years of professional experience do you have participating in cross-functional teams to integrate data engineering methodologies into ongoing investigations? *

How Many years of professional experience do you have creating data visualizations and dashboards to present findings in an easily interpretable manner for both technical and non-technical stakeholders? *

How Many years of professional experience do you have with preparing detailed reports and presentations for internal and external audiences? *

How Many years of professional experience do you have contributing to developing and refining internal analytics tools and platforms, enhancing the overall effectiveness of data-driven investigations? *

How Many years of professional experience do you have developing prototypes and algorithms to support specific investigative needs? *

How Many years of professional experience do you have applying Large Language Models (LLMs) to traditional Natural Language Processing (NLP) problems? *

How Many years of professional experience do you have utilizing LLMs to triage unstructured text and extract relevant information for investigations? *

How Many years of professional experience do you have in data analytics or data engineering, preferably with a focus on cybercrime or financial investigations? *

How Many years of professional experience do you have analyzing large and complex datasets to aid in financial investigations? *

How Many years of professional experience do you have explaining complex technical concepts to non-technical audiences? *

How Many years of professional experience do you have developing and implementing algorithms and prototypes? *

How Many years of professional experience do you have with traditional NLP techniques and tools? *

How Many years of professional experience do you have with Large Language Models (LLMs)? *

How Many years of professional experience do you have working with/assisting Federal agencies in cybercrime investigations? *

On a scale of 1 low - 5 high, how proficient are you in Python? *

On a scale of 1 low - 5 high, how proficient are you in "R" *

On a scale of 1 low - 5 high, how proficient are you in ETL processes and data integration? *

On a scale of 1 low - to 5 high, how proficient are you in data indexing, search, and discovery techniques? *

On a scale of 1 low - 5 high, how proficient are you in database systems and SQL? *

On a scale of 1 low - 5 high, how proficient are you with Tableau? *

On a scale of 1 low - 5 high, how proficient are you in PowerBI *

On a scale of 1 low - 5 high, how proficient are you with OpenAI’s GPT series? *

On a scale of 1 low - 5 high, how proficient are you with BERT? *

On a scale of 1 low - 5 high, how proficient are you with PaLM *

On a scale of 1 low - 5 high, how proficient are you in SQL *

What is the best time range to reach you (EST)? *