Job Information
Norstella Data Architect in Remote, India
Data Architect
Company: Norstella
Location: Remote, India
Date Posted: Dec 5, 2024
Employment Type: Full Time
Job ID: R-717
Description
About Norstella
At Norstella, our mission is simple: to help our clients bring life-saving therapies to market quicker—and help patients in need.
Founded in 2022, but with history going back to 1939, Norstella unites best-in-class brands to help clients navigate the complexities at each step of the drug development life cycle —and get the right treatments to the right patients at the right time.
Each organization (Citeline, Evaluate, MMIT, Panalgo, The Dedham Group) delivers must-have answers for critical strategic and commercial decision-making. Together, via our market-leading brands, we help our clients:
Citeline – accelerate the drug development cycle
Evaluate – bring the right drugs to market
MMIT – identify barrier to patient access
Panalgo – turn data into insight faster
The Dedham Group – think strategically for specialty therapeutics
By combining the efforts of each organization under Norstella, we can offer an even wider breadth of expertise, cutting-edge data solutions and expert advisory services alongside advanced technologies such as real-world data, machine learning and predictive analytics.
As one of the largest global pharma intelligence solution providers, Norstella has a footprint across the globe with teams of experts delivering world class solutions in the USA, UK, The Netherlands, Japan, China and India.
Job Description
Have you wondered how life saving drugs and therapies are created, tested, marketed and
made available to patients in need? Have you wondered how clinical trials are conducted at
a global scale? How governments and health authorities regulate various organizations
participating in this marketplace? Have you wondered how those companies and insurance
providers price a certain drug, and how a care provider determines the right treatment for
a given patient? If yes, Norstella could the next step in your career.
We are looking for a Data Engineer with a strong background in cloud data warehousing,
data pipelines, and ETL development. The ideal candidate will have extensive experience
with AWS services, Python, and advanced SQL, coupled with a solid understanding of data
modeling and ETL testing. The role requires a candidate who is proactive, detail-oriented,
and capable of leading projects within a collaborative team environment.
Key Requirements:
Cloud Data Architecture Design:
• Design and implement scalable, high-performance data models and architectures
using cloud data warehousing concepts.
• Develop and maintain data models (including Snowflake and Star Schema) for both
structured and unstructured data, ensuring optimal performance and reliability
across AWS services (e.g., S3, Redshift, Glue, and Lambda).
Data Pipeline and ETL Development:
• Build and manage data pipelines to ensure efficient data ingestion, processing, and
integration, utilizing tools like AWS Glue, Airflow, and Pyspark.
• Implement ETL processes to transform and load data from various sources into
Snowflake, Redshift, PostgreSQL, and other platforms, ensuring data completeness
and quality.
Advanced SQL and RDBMS Management:
• Leverage advanced SQL (including joins, subqueries, CTEs) and RDBMS concepts to
develop and optimize complex queries, with a preference for RDS SQL Server.
• Manage AWS RDS instances, specifically PostgreSQL, ensuring robust data storage
and retrieval processes.
Collaboration with Data Science Team:
• Work closely with Data Scientists to understand their data needs, ensuring data
availability and quality for real-world data (RWD) analysis and modeling.
• Provide Python and Pyspark-based data support, troubleshooting, and performance
tuning for data science projects.
Large Data Set Management:
• Handle large data sets with a focus on performance optimization, including
implementing strategies for data partitioning, indexing, and caching within AWS
ecosystems.
• Optimize the querying of large data sets to enhance performance and ensure
efficient data processing.
Performance Tuning and ETL Testing:
• Monitor and optimize data systems for performance, including query optimization,
resource management, and AWS DevOps CI/CD pipelines.
• Perform ETL testing to validate data completeness and quality across various data
feeds, resolving any bottlenecks in data processing and retrieval.
Data Delivery and Governance Ownership:
• Take ownership of data delivery processes, ensuring data is accurate, timely, and
accessible, while establishing and maintaining robust data governance policies and
procedures.
• Ensure data infrastructure is scalable, cost-effective, and aligns with industry best
practices, particularly in the life sciences/pharma domain.
Life Science Data Expertise:
• Apply deep knowledge of life science data and industry-specific requirements to
inform data architecture and modeling decisions, ensuring compliance with relevant
regulations and standards.
• Demonstrate strong leadership and a positive attitude, embodying Norstella's
principles in collaboration and project execution.
Required Skills and Qualifications:
• Cloud Data Warehousing Concepts: Strong understanding of cloud data
warehousing architectures and best practices.
• Data Pipelines/ETL Development: Proven experience in designing and implementing
data pipelines and ETL processes.
• RDS – Postgres: Hands-on experience with AWS RDS, specifically Postgres.
• Python & Pyspark: Proficiency in Python and Pyspark for data manipulation and
transformation.
• AWS Services: Experience with AWS ECS, Lambda, API Gateway, S3, RDS, Glue, and
Airflow.
• RDBMS & Advanced SQL: Expertise in RDBMS and advanced SQL, including joins,
subqueries, CTEs, and complex query writing, with a preference for RDS SQL Server.
• Data Modeling: Understanding of data modeling concepts, including Snowflake and
Star Schema.
• ETL Testing: Experience with ETL testing, focusing on data completeness and quality.
• AWS DevOps CI/CD: Experience with AWS DevOps tools and CI/CD pipelines.
• Life Sciences/Pharma Domain Knowledge: Familiarity with the life sciences or
pharmaceutical domain.
• Soft Skills: Strong leadership attitude, aligns with Norstella principles, and exhibits a
positive and collaborative work attitude.
Education: Minimum bachelor’s degree in computer science and engineering or related
field of study, or equivalent experience. 8+ years of experience as a Data Architect or in a
similar role, with demonstrated expertise in the required skills.
The guiding principles for success at Norstella
01: Bold, Passionate, Mission-First
We have a lofty mission to Smooth Access to Life Saving Therapies and we will get there by being bold and passionate about the mission and our clients. Our clients and the mission in what we are trying to accomplish must be in the forefront of our minds in everything we do.
02: Integrity, Truth, Reality
We make promises that we can keep, and goals that push us to new heights. Our integrity offers us the opportunity to learn and improve by being honest about what works and what doesn’t. By being true to the data and producing realistic metrics, we are able to create plans and resources to achieve our goals.
03: Kindness, Empathy, Grace
We will empathize with everyone's situation, provide positive and constructive feedback with kindness, and accept opportunities for improvement with grace and gratitude. We use this principle across the organization to collaborate and build lines of open communication.
04: Resilience, Mettle, Perseverance
We will persevere – even in difficult and challenging situations. Our ability to recover from missteps and failures in a positive way will help us to be successful in our mission.
05: Humility, Gratitude, Learning
We will be true learners by showing humility and gratitude in our work. We recognize that the smartest person in the room is the one who is always listening, learning, and willing to shift their thinking.
Benefits
Health Insurance
Provident Fund
Life Insurance
Reimbursement of Certification Expenses
Gratuity
24x7 Health Desk
Norstella is an equal opportunities employer and does not discriminate on the grounds of gender, sexual orientation, marital or civil partner status, pregnancy or maternity, gender reassignment, race, color, nationality, ethnic or national origin, religion or belief, disability or age. Our ethos is to respect and value people’s differences, to help everyone achieve more at work as well as in their personal lives so that they feel proud of the part they play in our success. We believe that all decisions about people at work should be based on the individual’s abilities, skills, performance and behavior and our business requirements. Norstella operates a zero tolerance policy to any form of discrimination, abuse or harassment.
Sometimes the best opportunities are hidden by self-doubt. We disqualify ourselves before we have the opportunity to be considered. Regardless of where you came from, how you identify, or the path that led you here- you are welcome. If you read this job description and feel passion and excitement, we’re just as excited about you.
Norstella is an equal opportunity employer. All job applicants will receive equal treatment regardless of race, creed, color, religion, alienage or national origin, ancestry, citizenship status, age, physical or mental disability or handicap, medical condition, sex (including pregnancy and pregnancy-related conditions), marital or domestic partner status, military or veteran status, gender, gender identity or expression, sexual orientation, genetic information, reproductive health decision making, or any other protected characteristic as established by federal, state, or local law.