Data Scientist/Data Architect
The primary focus will be applying data mining techniques and statistical analysis, and building high quality prediction systems in an unstructured big data environment.
Build and validate predictive models in SAS and R
Understanding of machine learning to uncover customer narratives in streaming data
Create automated anomaly detection systems and track their performance
Present results in a clear manner to upper management
Data Management and Architecture
Integrate data from multiple sources including third party sources.
Enhance data collection procedures to include data for building analytic systems
Processing, cleansing, and verifying the integrity of data used for analysis
Designs and build relational databases for data storage or processing
Develop strategies for warehouse implementation and data acquisition
Skills and Qualifications
Excellent understanding of machine learning techniques and algorithms, especially in the deep
Applied statistical skills and understanding of probability, statistical testing, regression, etc.
Experience with common data science toolkits, such as NumPy, scikit-learn, SAS, R
Experience with TB and PB datasets
Excellent communication skills
Experience with data visualization tools, such as D3.js, GGplot, Matplotlib, tableau
Proficiency in using query languages such as SQL, Hive, Pig
Experience with NoSQL databases, such as Redis/ElasticCache, Cassandra, HBase
Good scripting and programming skills, such as Java, Scala or Python, and Spark
Bachelors degree with an emphasis in Statistics