    TMR, Inc. is looking for a Data Scientist to support a client in Arlington, VA. The position requires ability to obtain a DHS Public Trust.


    Key Responsibilities

    • Develop an understanding of the customer’s data environment through data profiling and statistical analyses
    • Execute complex SQL queries of large Oracle table(s) efficiently. (Note:  Advanced command of SQL is important – beyond just simple PROC SQL commands in SAS to include perhaps something like Toad or Oracle SQL Developer.)
    • Obtain, scrub, explore, model and interpret data currently stored in Oracle databases – using SQL and other data mining tools
    • Provide accuracy and biometric sample quality based on machine learning and statistical analyses


    Mandatory Requirements

    • Bachelor's Degree in statistics or mathematics and minimum 10 years of experience or equivalent in the following:
      • Developing predictive models on accuracy using large data sets for high transactional volume environment
      • Evaluating and measuring performance of models
      • Common statistical modeling and techniques (e.g., linear regression, logistic regression, decision trees, etc.)
      • Understanding of and/or prior experience related to calculating False Acceptance Rate (FAR)/False Match Rate (FMR), False Rejection Rate (FRR)/False No-Match Rate (FNMR), True Acceptance Rate (TAR), and False Alarm Rate
      • Conceptual understanding of and/or prior experience related to data profiling, fuzzy matching, entity resolution, and signal detection theory (specifically with respect to SD theory: designing and improving upon systems that monitor, minimize, and balance false positive and false negative outcomes)
      • Experience related to biometric performance using Receiver Operating Characteristic (ROC), Detection Error Tradeoff (DET), Cumulative Match Characteristic (CMC) curves, and Identification and Detection Rate curves
      • Proficient in at least one of the following programming languages:
        • R
        • MATLAB
        • Julia
        • Java
        • Scalia
      • Experience with scripting languages for preprocessing and statistcal analysis (e.g. Python + Panda)
      • Proficient in at least one query language (e.g. SQL or HQL)


    • Understanding of big data ecoystems (e.g. Hadoop or Spark)

