Data Solution Engineer
Chubb
- Philadelphia, PA
- Permanent
- Full-time
- Strong analytical & logical skills
- Utilize the data engineering skills within and outside of the developing Chubb information ecosystem for discovery, analytics and data management
- Work with data scientists, architects, business partners and business analysts to understand requirements, design and build effective solutions
- Understanding of P&C insurance processes, risk data attributes and concepts such as Limits, Exposure bases, Coverages, Packaged, Rating Factors, etc. Experienced in identifying, capturing, profiling & analyzing data from multiple sources (internal and external).
- Skilled in identifying remediation solutions for addressing data issues. Experience in documenting data capture requirements, source to target mappings, data flow diagrams, entity relationships and complex data models. Experienced in translating business needs into systems requirements including rating systems, reports, dashboards and scorecards with minimal or no supervision
- Work with various relational and non-relational data sources with the target being Azure based SQL Data Warehouse & Cosmos DB repositories
- Work closely with the Data Science team to perform complex analytics and data preparation tasks
- Sourcing data from multiple applications, profiling, cleansing and conforming to create master data sets for analytics use
- Experience with Complex Data Parsing (Big Data Parser) and Natural Language Processing (NLP) Transforms on Azure a plus
- Design solutions for managing highly complex business rules within the Azure ecosystem
- Knowledge of Azure, Hadoop 2.0 ecosystems, HDFS, MapReduce, Hive, Pig, Sqoop, Mahout, Spark etc. a must
- Experience with Web Scraping frameworks (Scrappy or Beautiful Soup or similar)
- Extensive experience working with Data APIs (Working with RESTful endpoints and/or SOAP)
- Knowledge of any commercial distribution like Horton Works, Cloudera, MapR, DataBricks etc. a must
- Excellent working knowledge of relational databases, MySQL, Oracle etc.
- Hands on experience of ETL tools like Informatica and SSIS is preferred
- Experience with Complex Data Parsing (Big Data Parser) a must. Should have worked on XML, JSON and other custom Complex Data Parsing formats
- Natural Language Processing (NLP) skills
- Good knowledge of Python libraries like Pandas, NumPy, scikit-learn etc.
- Ready to learn new technologies and tools
- Bachelor's in computer science or related educational background
- Prior experience of Insurance domain a huge plus
- 5+ years of experience