GitHub - Isaac Arnault

Isaac Arnault, PhD

Ex-tech entrepreneur
Now multi-certified Data & AI engineering and Advanced Analytics senior consultant
Recognized Technical Sales Specialist on 6 solutions:
Snowflake, Cloudera, Databricks, SAP, IBM Watsonx, AWS (AWS Certified Partner)

"Semper Supra"

Fields of expertise
Analytics (financial planning / forecasting, OPEX / CAPEX budgeting, capacity planning), Business Intelligence, Reporting
Data (ETL / ELT, Massive Parallel Processing, RDBMS, Data Lineage, Data Quality, Data visualization)
Artificial Intelligence: Supervised / Unsupervised Learning, Machine Learning, Deep Learning
Infrastructures (On-Premises, Cloud-based, Virtualized), Middlewares, Cloud computing
Commitment to my Clients
I am able to pilot the delivery of high-end IT solutions from an end-to-end perspective on critical (physical, virtualized, hybrid) infrastructures.
Expression of the needs
State of the Art
Project objectives
Solutioning
Capacity planning
Solution pricing/costing
Client's pitch & pre-sales
Build
Delivery
Run / TMA / OMC
Career Milestone
Lead Data Architect on a 4BN € project for a major finance institution in 2021
bp2s logo

Latest certification


Latest cheat-sheets

  • 11.25.2022 Project Management

    ⇨ Project Management cheat-sheet for a job interview / Client facing

    Gist

  • 07.11.2022 Elastic

    ⇨ Elasticsearch cheat-sheet for a job interview / Client facing

    Gist

    Latest scripts

  • 03.08.2024 SAP S/4HANA
    ⇨ Deploy SAP S/4HANA Public Cloud Edition and an RDS (SQL Server) DB on AWS using Terraform (Infrastructure as Code)

    Gist

    Repository

  • 04.26.2024 Cloud (Terraform series)
    ⇨ Deploy a 3 Databricks nodes cluster using Terraform (IaC) on Azure

    Gist

    Repository

  • 10.10.2020 Cloud (Terraform series)
    ⇨ AWS VPC using Terraform and Jenkins integration

    Gist

    Repository

    Git

  • 09.15.2020 Cloud (Terraform series)
    ⇨ Get started with Terraform Cloud using GitHub and AWS

    Gist

    Repository

    Git

  • 02.01.2020 Cloud (AWS series)
    ⇨ PostgreSQL integration & setting up: an effective way

    Gist

    Repository

    Git

  • 11.05.2019 Cloud (AWS series)
    ⇨ Starting with Oracle DB on AWS and Oracle SQL developer remotely

    Gist

    Repository

    Git

  • 10.11.2019 Cloud (AWS series)
    ⇨ Creating a static website on AWS - Hands-on

    Gist

    Repository

    Git

  • 08.08.2019 Cloud (AWS series)
    ⇨ Deploying a serverless web app using an API Gateway and Lambda function on AWS - Hands-on

    Gist

    Repository

    Git

  • 07.27.2019 Cloud (AWS series)
    ⇨ Testing Elastic Load Balancing on AWS

    Gist

    Repository

    Git

  • 07.25.2019 Cloud (AWS series)
    ⇨ Creating Virtual Private Cloud (VPC) on AWS an enabling Flow Logs

    Gist

    Repository

    Git

  • 07.12.2019 Cloud (AWS series)
    ⇨ Deploying a Wordpress site using AWS RDS and free tier EC2 instance

    Gist

    Repository

    Git

  • 07.11.2019 Cloud (AWS series)
    ⇨ Mounting an Elastic File System (EFS) using TLS on two EC2 instances (Linux)

    Gist

    Repository

    Git

  • 07.10.2019 Cloud (AWS series)
    ⇨ Retrieving metadata from a EC2 bootstrapped instance (Linux)

    Gist

    Repository

    Git

  • 07.07.2019 Cloud (AWS series)
    ⇨ Accessing AWS S3 buckets remotely (Linux)

    Gist

    Repository

    Git

  • 07.05.2019 Cloud (AWS series)
    ⇨ Creating a Web Server on AWS using EC2 (Linux)

    Gist

    Repository

    Git


  • DATA ENGINEERING

  • 01.01.2020 Spark Scala
    ⇨ Introducing Machine Learning using Spark-Scala and IntelliJ

    Gist

    Repository

    Git

  • 09.22.2019 Spark Scala
    ⇨ Data engineering using Spark-Scala - Hands-on

    Gist

    Repository

    Git



  • DATA SCIENCE

  • 01.12.2019 Python
    ⇨ Edges detection of an image using Numpy, PIL, Math and Requests packages

    Gist

    Repository

    Git

  • 12.15.2018 Python
    ⇨ Dates estimator using Python

    Gist

    Repository

    Git

  • 12.10.2018 Python R
    ⇨ Data collection and statistics using Python and R

    Gist

    Repository

    Git

  • 12.03.2018 R
    ⇨ Data exploration and visualization using R

    Gist

    Repository

    Git

  • 11.27.2018 Python
    ⇨ Permutations using Python

    Gist

    Repository

    Git

  • Methodology
    My approach as Consultant

    isaac arnault data science



    BIG DATA

  • 11.02.2019 Hadoop
    ⇨ Deploying a Hadoop cluster for Test purposes using AWS EC2, Docker and Cloudera

    Gist

    Repository

    Git

  • 04.01.2019 Big Data
    ⇨ Big Data and Data Science - Tools installation (Linux)

    Gist

    Repository

    Git

  • Big Data bootcamp
    Nov. 2018 - Mar. 2019 | Project defence

    big data architecture 1

    big data architecture 2

    nifi architecture


    Solution architecture
    Bootcamp - December / March 2019

    datalake

    lambda 1

    lambda 2


    Education - IT courses

    Courses certificates

    Teamwork Foundations - Issued by: LinkedIn® - Issued on: DEC 2022

    PMI - Teamwork

    Become a Project Manager - Issued by: LinkedIn® - Issued on: DEC 2022

    PMI - Project Manager

    Project Management - Communication - Issued by: PMI® - Issued on: DEC 2022

    PMI - Communication

    Project Management Simplified - Issued by: PMI® - Issued on: DEC 2022

    PMI - Stakeholders

    Project Management - Managing Project Stakeholders - Issued by: PMI® - Issued on: DEC 2022

    PMI - PM Simplified

    Project Management Foundations : Risk - Issued by: PMI® - Issued on: DEC 2022

    PMI - Risk Management

    Project Management Foundations : Communication - Issued by: PMI® - Issued on: DEC 2022

    PMI - Communication

    Project Management : Leaning Gantt Charts - Issued by: PMI® - Issued on: DEC 2022

    PMI - Gantt

    Project Management Foundations : Teams - Issued by: PMI® - Issued on: DEC 2022

    PMI - Teams

    Project Management Foundations : Budgets - Issued by: PMI® - Issued on: DEC 2022

    PMI - Enhancing your productivity

    Project Management : Enhancing Your Productivity - Issued by: PMI® - Issued on: DEC 2022

    PMI - Enhancing your productivity

    Project Management : Solving Common Project Problems - Issued by: PMI® - Issued on: DEC 2022

    PMI - Solving problems

    Project Management Foundations : Schedules - Issued by: PMI® - Issued on: DEC 2022

    PMI - schedules

    Project Management Foundations : Requirements - Issued by: PMI® - Issued on: DEC 2022

    PMI - requirements

    Project Management Foundations : Ethics - Issued by: PMI® - Issued on: DEC 2022

    PMI - Ethics

    Project Management Foundations - Issued by: PMI® - Issued on: DEC 2022

    PMI

    Project Management Foundations - Issued by: NASBA® - Issued on: DEC 2022

    NASBA

    Project Coordination - Issued by: PMI® - Issued on: DEC 2022

    PMI - PM

    AWS Partner: Sales Accredication (Technical) - Issued by: AWS® - Issued on: NOV 2022

    aws partner - technical sales accredidation

    AWS Partner: Sales Accredication (Business) - Issued by: AWS® - Issued on: NOV 2022

    aws partner - business sales accredidation

    AWS Partner: Cloud Economics Accreditation - Issued by: AWS® - Issued on: NOV 2022

    aws partner - cloud economics

    Customer Service Fundamentals - Issued by: Knowledge Accelerators® - Issued on: NOV 2022

    Z Performance and Pricing

    Z Performance - z/OS Performance Tools and Software Pricing - Issued by: Kyndryl® - Issued on: NOV 2022

    Z Performance and Pricing

    Z Performance - z/OS I/O Performance and Capacity Planning - Issued by: Kyndryl® - Issued on: NOV 2022

    Z Performance and Capacity Planning

    Z Performance - z/OS Workload Manager - Issued by: Kyndryl® - Issued on: NOV 2022

    Z Performance

    DB2 v12 Operations - Issued by: Kyndryl® - Issued on: NOV 2022

    DB2 v12

    Mainframe Performance - Issued by: Kyndryl® - Issued on: NOV 2022

    mainframe performance

    z/VSE Operations - Issued by: Kyndryl® - Issued on: NOV 2022

    z_VSE operations

    Linux Operations - Issued by: Kyndryl® - Issued on: NOV 2022

    linux operations

    Linux Shell programming - Issued by: Kyndryl® - Issued on: NOV 2022

    linux shell

    The Linux File System - Issued by: Kyndryl® - Issued on: OCT 2022

    file storage

    Coaching and Mentoring for Technical Specialists - Issued by: Kyndryl® - Issued on: OCT 2022

    mentoring

    Data Foundation - Issued by: Collibra® - Issued on: OCT 2022

    Collibra

    Administrator Training - CDP Private Cloud Base - Issued by: Cloudera® - Issued on: SEP 2022

    IBM

    ETL and Data Pipelines with Shell, Airflow and Kafka - Issued by: IBM® - Issued on: JULY 2022

    IBM

    Application Development using Microservices and Serverless - Issued by: IBM® - Issued on: JULY 2022

    IBM

    Agile Explorer - Powered by Agile at IBM - Issued by: IBM® - Issued on: JULY 2022

    IBM

    Agile Operations Fundamentals - Issued by: IBM® - Issued on: JULY 2022

    IBM

    Project Management Fundamentals - Issued by: IBM® - Issued on: JULY 2022

    IBM

    Process Controls - Issued by: IBM SkillsBuild® - Issued on: JULY 2022

    SkillsBuild

    Problem Solving - Issued by: IBM SkillsBuild® - Issued on: JULY 2022

    SkillsBuild

    Personality Dynamics - Issued by: IBM SkillsBuild® - Issued on: JULY 2022

    SkillsBuild

    Communication Skills - Issued by: IBM SkillsBuild® - Issued on: JULY 2022

    SkillsBuild

    Deep Learning with Tensorflow - Issued by: CognitiveClass® - Issued on: JULY 2022

    Cognitive Class

    Accelerating Deep Learning with GPUs - Issued by: CognitiveClass® - Issued on: JULY 2022

    Cognitive Class

    Elasticsearch engineer - Issued by: Koenig Solutions® - Issued on: JULY 2022

    Koenig certificate

    Mastering Terraform - Integrating with Jenkins and Ansible - Issued by: Udemy® - Issued on: OCT 2020

    Udemy certificate

    Network Security & Database Vulnerabilities - Issued by: Coursera® IBM® - Issued on: SEP 2020

    Network Security and Database Vulnerabilities

    AWS Certified Solutions Architect - Associate [Latest Exam] - Issued by: Udemy® - Issued on: JUL 2020

    AWS Certified Solution Architect Associate - Exam

    AWS Certified Solutions Architect - Associate - Issued by: Udemy® - Issued on: JUL 2019

    AWS Certified Solutions Architect - Associate

    Spark Fundamentals II - Issued by: CognitiveClass® - Issued on: OCT 2019

    Spark Level II

    Analyzing Big Data in R using Apache Spark - Issued by: CognitiveClass® - Issued on: OCT 2019

    Spark R

    Exploring Spark's GraphX - Issued by: CognitiveClass® - Issued on: OCT 2019

    Spark MLlib

    Spark MLlib - Issued by: CognitiveClass® - Issued on: OCT 2019

    Spark MLlib

    Data Science with Scala - Issued by: CognitiveClass® - Issued on: OCT 2019

    Scala

    Spark Overview for Scala Analytics - Issued by: CognitiveClass® - Issued on: SEP 2019

    Cognitive Class

    Scala 101 - Issued by: CognitiveClass® - Issued on: SEP 2019

    Cognitive Class

    Exam Readiness: AWS Certified Solutions Architect – Associate - Issued by: AWS® - Issued on: SEP 2019

    AWS certified solution architect - associate

    AWS Fundamentals: Going Cloud-Native - Issued by: Coursera® - Issued on: AUG 2019

    AWS fundamentals

    AWS Certified Cloud Practitioner Essentials (Second Edition) - Issued by: AWS® - Issued on: AUG 2019

    AWS cloud practitioner training

    Data Analysis with Python - Issued by: CognitiveClass® - Issued on: DEC 2018

    Cognitive Class

    Moving Data into Hadoop - Issued by: CognitiveClass® - Issued on: DEC 2018

    Cognitive Class

    Python 101 for Data Science - Issued by: CognitiveClass® - Issued on: DEC 2018

    Cognitive Class

    Data Privacy Fundamentals - Issued by: CognitiveClass® - Issued on: NOV 2018

    Cognitive Class

    Digital Analytics & Regression - Issued by: CognitiveClass® - Issued on: NOV 2018

    Cognitive Class

    Data Visualization with R - Issued by: CognitiveClass® - Issued on: NOV 2018

    Cognitive Class

    Predictive Modeling Fundamentals I - Issued by: CognitiveClass® - Issued on: OCT 2018

    Cognitive Class

    R 101 - Issued by: CognitiveClass® - Issued on: OCT 2018

    Cognitive Class

    Machine Learning with R - Issued by: CognitiveClass® - Issued on: OCT 2018

    Cognitive Class

    Data Science Hands-On with Open Source tools - Issued by: CognitiveClass® - Issued on: OCT 2018

    Cognitive Class

    Reactive Architecture: Introduction to Reactive Systems - Issued by: CognitiveClass® - Issued on: OCT 2018

    Cognitive Class

    SAS® Programming 1: Essentials - Issued by: SAS® - Issued on: OCT 2018

    SAS

    Spark Fundamentals I - Issued by: CognitiveClass® - Issued on: OCT 2018

    apache spark

    MapReduce and Yarn- Issued by: CognitiveClass® - Issued on: OCT 2018

    SCognitive Class

    Hadoop 101 - Issued by: CognitiveClass® - Issued on: OCT 2018

    Cognitive Class

    Big Data 101 - Issued by: CognitiveClass® - Issued on: OCT 2018

    Cognitive Class


    Skills

    Programming languages & tools
  • Web Dev
    Html
    Css
    Jquery, Javascript
    Ajax
    XML
    CMS: Joomla, Wordpress, Drupal
    Databases
    Relational: SQL Server, PostgreSQL, MySQL, Oracle, BigQuery, Hive, Azure SQL DB, AWS RDS
    Non-relational: HBase, DynamoDB, Cassandra, MongoDB / MongoDB Atlas
    Graph-oriented: Neo4J
    Time-series: InfluxDB
    Data Engineering
    ETLs: Ab Initio, Talend, Knime, Informatica, Google Dataflow
    Languages: Python, Spark-Scala, PySpark / SparkSQL, R / R Shiny App
    Data platforms: Databricks, Dataiku, Datadog
    Data warehouses: AWS Redshift, Azure Synapse, Google BigQuery
    Architecture patterns: Data lakes, Data warehouses, Lakehouses (Delta Lakes), Data Mesh
    MDM & Governance: Informatica, EBX Object Storage: AWS S3, Azure Blob Storage, GCP Cloud Storage
    Data Science
    Concepts: Supervised learning, Unsupervised learning, Semi-supervised learning, Reinforcement learning
    Algorithms: classification, regression (linear, multi-variate, logistic), neural networks
    IDE: Jupyter Notebook, Atom, IntelliJ, Anaconda, PyCharm, Eclipse, Zeppelin, Spyder
    Languages: Python, R
    APM - Application Performance Monitoring
    Logs: Syslog, Logstash, Sciencelogic, Prometheus
    Tooling: Dynatrace, Splunk, Elastic APM, Datatog, Grafana, LogicMonitor
    APIM
    Protocols: RESTful, SOAP
    Tooling: 3Scale, Apigee, Postman
    Data visualization
    Tooling: PowerBI, Tableau, Kibana, D3.js, Excel, Spotfire
    Virtualization
    Tooling: Vmware, Veeam, Vagrant, Virtualbox, Azure VMs, Google Compute Engine, AWS EC2, AWS ECS
    Project workflow
    Tooling: Jira, Confluence, Trello, Sharepoint
    Deployment / Automation
    CI/CD : Git / GitHub / GitLab / GitHub Actions, Jenkins, Ansible, Chef, Airflow
    Containerization: Docker
    Hyperscalers
    AWS ****
    Azure ***
    GCP **
    Indexation / Query Engine
    Elasticsearch
    Apache Solr
    Impala
    Big Data
    Hadoop : Cloudera (CDH, CDP), MapR, Hortonworks (HDP)
    Apache Solr

  • Interests

    My main interest concerns Distributed Systems from a middleware perspective, by helping IT companies facing challenges on both technical and functional levels.

    Certifications


    Professional Certification - Big Data & Artificial Intelligence

    isaac-arnault-hortonworks isaac-arnault-hortonworks

    IT Leadership - Management - Coaching certifications

    IT Project Management - Customer Engagement - Agile certifications

    Architecture - Microservices certifications

    Data Management - Data Warehousing - Data Governance certifications

    Data Protection - Privacy Engineering - Data clearance certifications

    ETL & Data Processing - Data Analysis & visualization certifications

    Cloud Computing - Virtualization certifications

    Mainframe certifications

    Data Science - Machine Learning - Deep Learning certifications

    Big Data - Hadoop certifications

    Spark - Scala certifications

    User Experience - Product onboarding certifications

    Enterprise Resource Plannning certifications

    MDM & Data Governance

    Isaac Arnault x github.io ©2024