Big Data Technologies in the Cloud Training

Level

Intermediate

Duration

24h / 3 days

Date

Individually arranged

Price

Individually arranged

Big Data Technologies in the Cloud Training

The Big Data Technologies in the Cloud training is an intensive, practical workshop designed for engineers, system administrators, and IT professionals who want to learn how to effectively build and manage modern Big Data infrastructure in cloud environments (AWS, Azure, Google Cloud). Participants will become familiar with key technologies, tools, and architectural patterns used for storing, processing, and analyzing large volumes of data in the cloud. The course combines up-to-date theoretical knowledge with a dominant practical component based on real project scenarios.

What You Will Learn

  • You will learn the architectures and capabilities of major Big Data and cloud technologies
  • You will master effective techniques for storing, securing, and processing large datasets using tools like S3, Hadoop, Spark, and NoSQL
  • You will gain skills in automating data integration and analysis using dedicated cloud services
  • You will acquire practical experience in designing, deploying, and optimizing Big Data solutions in cloud environments
Who is this training for?
  • logo infoshare Software engineers and system administrators implementing or maintaining Big Data solutions in the cloud
  • logo infoshare Data analysts and Data Science professionals who want to increase their competencies in data processing and analysis
  • logo infoshare Individuals planning to migrate existing systems or deploy new projects based on Big Data and public or hybrid cloud environments
  • logo infoshare IT solution architects who want to implement modern, scalable data platforms

Training Program

  1. Day 1: Big Data Fundamentals in the Cloud

  • Module 1: Basics of Big Data and Cloud

    • Introduction to the Big Data concept and the 5Vs (volume, velocity, variety, veracity, value)
    • Overview of cloud types: IaaS, PaaS, SaaS
    • Main cloud service providers
    • Fundamental AWS services

      • Compute: EC2, Lambda
      • Storage: S3, EBS, Glacier
      • Networking: VPC, Internet Gateways
    • Monitoring: CloudWatch, CloudTrail
    • Security and identity management: IAM
  • Module 2: Data Lake Architecture and Data Storage

    • Building a Data Lake in the cloud
    • Amazon S3, Azure Blob Storage, Google Cloud Storage
    • Managing permissions, data versioning, and data security
  1. Day 2: Data Processing and Analysis

  • Module 3: Processing Large Data Sets

    • Distributed file systems in the cloud (HDFS, S3 integration)
    • Core processing engines

      • Hadoop
      • Spark (AWS EMR, Azure Databricks)
      • MapReduce
      • YARN
  • Module 4: NoSQL Databases and Data Warehouses

    • NoSQL technologies: HBase, Cassandra, MongoDB in the cloud
    • Data warehouses: Amazon Redshift, Google BigQuery, Azure Synapse Analytics
  1. Day 3: Advanced Technologies and Case Studies

  • Module 5: Data Integration, Orchestration, and Automation

    • Data integration services: AWS Glue, Azure Data Factory, Google Dataflow
    • Workflow orchestration and automation: Oozie, AWS Step Functions
  • Module 6: Analysis, Visualization, and Security

    • Cloud data analysis: Athena, BigQuery, Spark SQL
    • Data visualization: Jupyter Notebook, Zeppelin, BI tools
    • Security: secret storage, access auditing, compliance (IAM, Azure Key Vault)

Contact us

we will organize training for you tailored to your needs

Przemysław Wołosz

Key Account Manager

przemyslaw.wolosz@infoShareAcademy.com

    The controller of your personal data is InfoShare Academy Sp. z o.o. with its registered office in Gdańsk, al. Grunwaldzka 427B, 80-309 Gdańsk, KRS: 0000531749, NIP: 5842742121. Personal data are processed in accordance with information clause.