Msys Inc ATS

Job summary:

Title:

Big Data Consultant

Location:

Columbus, OH, United States

Length and terms:

Long term - W2 or C2C

Position created on 12/03/2021 07:52 pm

Job description:

*** Webcam interview *** Long term contract *** Initial remote due to Covid then onsite; must pick up laptop in person

Updates From Manager:

Experience in analysis, design, development, support and enhancements in data warehouse environment with Cloudera Bigdata Technologies (with a minimum of 8+ years experience in data analysis, data profiling, data model, data cleansing and data quality analysis in various layers using Database queries both in Oracle and Big Data platforms).
Experience (minimum of 8+ years) in working Erwin Data Model tool, hive/impala queries, Unix commands, scripting and shell scripting etc.

SCOPE OF WORK summary:

The Senior Database Architect will be responsible for Medicaid Enterprise data warehouse design, development, implementation, migration, maintenance, and operational activities. The candidate will closely with Data Governance and Analytics team. Will be one of the key technical resource for data warehouse projects for various Enterprise data warehouse projects and building critical data marts, data ingestion to Big Data platform for data analytics and exchange with State and Medicaid partners. This position is a member of Medicaid ITS and OST closely with the Business Intelligence & Data Analytics team.

Detailed Day-To-Day Job Duties to be performed:

Participate in Team activities, Design discussions, Stand up meetings and planning Review with team.
Perform data analysis, data profiling, data cleansing and data quality analysis in various layers using Database queries both in Oracle and Big Data platforms.
Eliciting, analyzing and documenting functional and non-functional requirements.
Document business requirements, meeting minutes, and key decisions/actions.
Lead client meetings and sessions with data-driven analysis to clarify requirements and design decisions.
Perform data gap and impact analysis due to new data addition and existing data changes for any new business requirements and enhancements.
Follow the organization design standards document, create data mapping specification document, pseudo codes for the development team(s) and design documents.
Create logical & physical data models.
Review and understand existing business logic used in Oracle and Hadoop ETL platforms to verify against the business user needs.
Review PySpark programs that are used to ingest historical and incremental data.
Review SQOOP scripts to ingest historical data from Oracle database to Hadoop IOP, created HIVE tables and Impala view creation scripts for Dimension tables.
Assist Business Analyst to create Test Plan, Design Test scenarios, SQL scripts (prefer Oracle and Hadoop), test or mockup data, executes the test scripts.
Validate test results and records as well as log and research defects.
Analyze production data issues, report problems and find solutions to fix the issues, if any.
Create incidents and tickets to fix production issues, create Support Requests to deploy code for development team to UAT environment.
Participate in meetings to continuously upgrade the Functional and technical expertise.
Establish priorities & follow through on projects, paying close attention to detail with minimal supervision.
Create and present project plan, project status and other dashboards as necessary.
Perform other duties as assigned.

REQUIRED Skill Sets:

8+ years Data analysis/architecture experience in Waterfall and Agile Methodology in various domains (prefer Healthcare) in a data warehouse environment.
Good knowledge of relational database, Hadoop big data platform and tools, data vault and dimensional model design.
Strong SQL experience (prefer Oracle, Hive and Impala) in creating DDLs and DMLs in Oracle, Hive and Impala (minimum of 8 years experience).
Experience in analysis, design, development, support and enhancements in data warehouse environment with Cloudera Bigdata Technologies (with a minimum of 8-9 years experience in Hadoop, MapReduce, Sqoop, PySpark, Spark, HDFS, Hive, Impala, Stream Sets, Kudu, Oozie, Hue, Kafka, Yarn, Python, Flume, Zookeeper, Sentry, Cloudera Navigator) along with Informatica.
Experience (minimum of 8 years) in working with Sqoop scripts, PySpark programs, HDFS commands, HDFS file formats (Parquet, Avro, ORC etc.), Stream Sets pipelines, jobs scheduling, hive/impala queries, Unix commands, scripting and shell scripting etc.
Experience in migrating data from relational database (prefer Oracle) to big data Hadoop platform is a plus.
Experience eliciting, analyzing and documenting functional and non-functional requirements.
Ability to document business, functional and non-functional requirements, meeting minutes, and key decisions/actions.
Experience in identifying data anomalies.
Experience building data sets and familiarity with PHI and PII data.
Ability to establish priorities & follow through on projects, paying close attention to detail with minimal supervision.
Effective communication, presentation, & organizational skills.
Good experience in working with Visio, Excel, PowerPoint, Word, etc.
Effective team player in a fast paced and quick delivery environment.
Required Education: BS/BA degree or combination of education & experience.

DESIRED Skill Sets:

Demonstrate effective leadership, analytical and problem-solving skills
Required excellent written and oral communication skills with technical and business teams.
Ability to work independently, as well as part of a team
Stay abreast of current technologies in area of IT assigned
Establish facts and draw valid conclusions
Recognize patterns and opportunities for improvement throughout the entire organization
Ability to discern critical from minor problems and innovate new solutions

Required skills:

8 years of Experience in analysis, design, development, support and enhancements in data warehouse environment with Cloudera Bigdata Technologies
Experience (8+ years) in working Erwin Data Model tool, hive/impala queries, Unix commands, scripting and shell scripting etc..
using Database queries both in Oracle and Big Data platforms
Good knowledge of relational database, Hadoop big data platform and tools, data vault and dimensional model design.
8 years of Strong SQL experience (prefer Oracle, Hive and Impala) in creating DDLs and DMLs in Oracle, Hive and Impala

Contact the recruiter working on this position:

The recruiter working on this position is Chinmayee Patro(Raghu Team)
His/her contact number is +(1) (202) 6979490
His/her contact email is chunlipatro@msysinc.com

Our recruiters will be more than happy to help you to get this contract.