Thursday, January 16, 2025

Azure Blob Storage – Complete Overview

 



🔷 What is Azure Blob Storage?

Azure Blob Storage is Microsoft Azure’s object storage solution for the cloud, designed to store massive volumes of unstructured data such as text or binary files.

Blob = Binary Large Object

  • Data is structured as:
    Storage Account → Containers → Blobs

  • Each blob is uniquely addressable via a REST-based URL.


🔷 Types of Azure Blobs

Blob TypeDescriptionUse Case
Block BlobsStore text and binary dataImages, videos, documents
Append BlobsOptimized for append operationsLogging, auditing
Page BlobsOptimized for random read/write operations (512-byte pages)Azure VM disks (VHD files)

🔷 When to Use Azure Blob Storage

Use Azure Blob Storage for:

  • Hosting media content (images, videos, audio)

  • Application logs and telemetry

  • Backups, snapshots, and archives

  • Static website hosting

  • CI/CD build artifacts storage

  • IoT and streaming data

  • Data lake for analytics and ML workloads


🔷 DevOps Engineer Use Cases

🛠️ Artifact Storage

  • Store build outputs like .jar, .zip, .dll, etc.

  • Integrates with Azure DevOps, GitHub Actions, Jenkins.

📦 Pipeline Integration Example

bash
az storage blob upload \ --account-name mystorageaccount \ --container-name artifacts \ --name app-build.zip \ --file ./builds/app-build.zip \ --auth-mode login

🔐 Environment Config & Secrets

  • Store configs per environment.

  • (Note: For secrets, prefer Azure Key Vault.)

📊 Logging & Monitoring

  • Centralized logging from Azure VMs, App Services.

  • Long-term storage of diagnostic logs.

💾 Disaster Recovery & Backup

  • Backup state files, infra scripts, app configurations.

  • Enable geo-redundant storage for resiliency.


🔷 Azure Blob Storage Tiers

TierUse CaseCost Characteristics
HotFrequently accessed dataHigh storage cost, low access cost
CoolInfrequently accessed data (≥30d)Lower storage cost, higher access cost
ArchiveRarely accessed data (≥180d)Lowest storage cost, high latency to access (hours)

🔁 Use lifecycle management policies to auto-transition blobs between tiers.


🔷 Redundancy & Replication Options

Redundancy OptionDescription
LRSLocally Redundant (within 1 datacenter)
ZRSZone-Redundant (across 3 Availability Zones)
GRSGeo-Redundant (replicated to secondary region)
RA-GRS / GZRSRead-access Geo + Zonal Redundancy for best resilience

🔷 Security Features

  • Encryption:

    • SSE with Microsoft-managed or Customer-managed Keys (CMK)

  • Private Endpoints:

    • Access Blob via Azure VNet—no internet exposure

  • Access Control:

    • RBAC via Azure AD

    • Shared Access Signatures (SAS)

    • Access Control Lists (ACLs)

  • Immutable Storage:

    • Time-based or Legal Hold for compliance (e.g., financial data retention)


🔷 Tools for Managing Azure Blob Storage

ToolUse
Azure PortalWeb-based management
Azure CLI / PowerShellScripting, automation
Azure Storage ExplorerGUI-based storage management
SDKs (Python, .NET, Java, Node.js)Developer integration

🔷 AWS Equivalent: Amazon S3

FeatureAzure Blob StorageAmazon S3
Object Storage
Access TiersHot, Cool, ArchiveStandard, Intelligent-Tiering, Glacier
Blob/Object TypesBlock, Append, PageStandard Objects
CLI ToolsAzure CLI, PowerShellAWS CLI
Lifecycle Management
RedundancyLRS, ZRS, GRS, GZRSStandard, One Zone-IA, Glacier
Versioning
Access ControlRBAC, SAS, ACLsIAM, Bucket Policies
CI/CD IntegrationAzure DevOps, GitHub ActionsCodePipeline, CodeBuild

Summary

FeatureDetails
Service NameAzure Blob Storage
TypeObject Storage for Unstructured Data
Primary Use CasesMedia hosting, backups, CI/CD artifacts, logging, analytics
DevOps IntegrationAzure CLI, Azure DevOps, Jenkins, GitHub Actions, Storage Explorer
TiersHot, Cool, Archive
RedundancyLRS, ZRS, GRS, GZRS
SecurityEncryption, RBAC, SAS, Private Endpoints, Immutable Storage
AWS EquivalentAmazon S3

No comments:

Post a Comment