Lead Data Warehouse Developer

athenahealth, Inc.
Austin, Texas
Apply Now
  • Job Type
    Employee

Lead Data Warehouse Developer(athenahealth, Inc.; Austin, TX): Lead Data Warehouse Developer will work on Epocrates; join team of entrepreneurs, architects, engineers, UXers & product ppl; be vital link b/w IT roadmap, capacity & bus. partners’ reqs; support & ensure successful execution of tech. analysis, building, testing & deployment tasks to support key bus. initiatives; apply firm grasp of data warehouse methodologies & strong background in data engineering from eclectic array of data sources to take on design, architecture & delivery of critical info. by working w/ Product Leads & Analysts to analyze bus. reqs, design & develop Data/BI solutions to ingest, integrate & provision structured & semi-structured data; be responsible for modernization & growth of data-platform w/ latest tools in Predictive Analytics, Machine/Deep Learning, Context Personalization & real-time data ingestion & delivery; liaise w/ Product Leads to understand product reqs & strategize how to realize product goals w/ Agile SCRUM methodologies; assist & train BI Analysts to develop SAS Statistical Analysis code for performing complex data tasks & to use tools like Tableau Visualization suite; engage w/ Product & Pharma/Medical teams to ensure compliance w/ regs, incl. HIPAA, GDPR & CA Consumer Privacy Act by writing advance data-scrubbing routines & masking methodologies to ensure data-privacy for sensitive healthcare data; design, develop & implement complex ETL pipelines w/ tools like Informatica PowerCenter, Informatica Intelligent Cloud Services, Data Quality & Axon; develop & optimize ETL data routines written w/ mixture of Perl Scripts, Shell Scripts, Python, Java & Base SAS 7.1 code; modernize data pipelines by designing & creating real-time ingestion sols. w/ tools like Apache Kafka, Apache Storm, Cassandra; write modern containerized apps w/ platforms like Docker to deploy on IaaS infrastructure or Epocrates’ Amazon EC2 & ECS instances; maintain & modernize large SQL code base running on distributed Oracle 11g, 12c, PostgreSQL & Snowflake DBs; optimize & manage 100s of TBs worth of Epocrates Medical, Drug & app data stored in over 500 DB tables across many different DBs; perform continuous DB performance optimizations w/ various tools (Oracle Automatic Workload Repository, Automatic DB Diagnostic Monitor) by coming up w/ new DB Profiling schemes, automatic statistics, indexing, time-based partitioning & DB sharing schemes; use data-modelling tools like Erwin & SAP Power Designer to model complex Physical & Logical data-models; maintain documentation & source control of data-models & ER relationships w/ tools like SVN, Perforce & BitBucket repositories; lead Data Systems SCRUM team implementing all planned sols. & production support w/ iterative Agile methodology; actively plan & manage software-devpmt tasks w/ Agile ceremonies like Bi-weekly Sprint Reviews, Retrospectives, Sprint Grooming & Planning; track performance mgmt w/ SCRUM Burn Down Charts & Agile Boards; build Machine Learning (ML) models w/ quantitative analysis techniques (predictive modeling, segmentation, optimization, clustering, regression); build scalable, highly optimized enterprise reporting sols. w/ Tableau & Power BI running out of Snowflake datalakes; design new DataMarts w/ Command & Query Responsibility Segregation (CQRS) design pattern w/ Snowflake, AWS EC2 & Apache tools (Spark); build data driven ML models (based on K-Means Clustering, Expectation–Maximization Clustering, Mean-Shift Clustering) to translate data into intel. & help transform clinical decision support & other strategic bus. problems in healthcare (patient engagement, drug interactions); use Deep Learning frameworks (MXNet, Tensorflow, Theano & Keras) to help build various ML models; design & apply stat. techniques to evaluate & monitor outcomes of automated clinical decision support at scale w/ ML; & engage w/ Sales, Marketing & Product Outreach Leads to design scalable data sols. w/ ML to help make better decisions.

 

Minimum reqs: Masters in CS or related + 3yrs demonstrated exp working w/ TB-scale DBs/data-warehousing solutions.

 

Must have: Demonstrated exp. w/: writing & managing complex ETL integration scripts in various platforms (Windows, Linux, Mac OSX); at least one BI tool, such as SAS, Power BI, or Tableau; Snowflake or other data lake solutions tool; AWS technologies/tools such as EC2, ECS, SNS, SQS, or Lambda; open-source, real-time, streaming data-pipelines; ML technologies or frameworks, such as Keras, scikit-learn, or Tensorflow; & as data-developer using programming languages, such as C#, Java, Python, Scala, SQL scripting, or JavaScript based frameworks. (Unless otherwise indicated, athenahealth, Inc. is seeking ability in skills listed above w/ no specific yrs of exp. req’d. All exp. can be gained concurrently.)

 

Apply online at «https://www.athenahealth.com/careers»or send resume: Amanda Santamour, GMIMathenahealth, 311 Arsenal St., Watertown, MA 02472. Ref: 00026711. An EOE.


#LI-DNI

Categories

Share this job:

Lead Data Warehouse Developer

athenahealth, Inc.
Austin, Texas

Join us to start saving your Favorite Jobs!

Sign In Create Account