Serverless Data Pipeline
Data Engineering
- Enables data to decision journey: Raw data to dashboard
- ELT: Raw Data, Ingestions, Transformations, Aggregations
Google Cloud services
![Google Cloud services Cloud Storage](https://fractal.ai/wp-content/uploads/2023/04/cloud_storage.png)
Cloud Storage
![Google Cloud services Cloud functions](https://fractal.ai/wp-content/uploads/2023/04/cloud_functions.png)
Cloud Functions
![Google Cloud services Big Query](https://fractal.ai/wp-content/uploads/2023/04/bigquery.png)
Big Query
![Google Cloud services Cloud Build](https://fractal.ai/wp-content/uploads/2023/04/cloud_build.png)
Cloud Build
![Google Cloud services Cloud deployment manager](https://fractal.ai/wp-content/uploads/2023/04/cloud_deployment_manager.png)
Deployment Manager
![Google Cloud services Cloud Monitoring](https://fractal.ai/wp-content/uploads/2023/04/cloud_monitoring.png)
Cloud Monitoring
![Google Cloud services Cloud Logging](https://fractal.ai/wp-content/uploads/2023/04/cloud_logging.png)
Cloud Logging
![image-39.png](https://fractal.ai/wp-content/uploads/2023/04/image-39.png)
Cloud Repositories
Features
![Group-48095549-1.png](https://fractal.ai/wp-content/uploads/2023/04/Group-48095549-1.png)
- Serverless – No Infra provisioning
- Auto Scalable – Scales up and down as per data size
- Deployment - Managed and automated
- Security – Access-controlled & Encryption-enabled
- Performance – Can process GBs of data in seconds
- File types supported: CSV, JS
- Low to medium complexity of data movements
- Medium data volumes
- Incoming data per file up to 5 GB
- Transformation Query run time is not more than 5 minutes
- Less number of interdependent source files
- <= 200 source files on daily basis
![Group-48095556.png](https://fractal.ai/wp-content/uploads/2023/04/Group-48095556.png)
Adoption
Operationalization
![Frame-4379-1.png](https://fractal.ai/wp-content/uploads/2023/04/Frame-4379-1.png)
Deployment -
Automated through Deployment Manager.
![Frame-4379-2.png](https://fractal.ai/wp-content/uploads/2023/04/Frame-4379-2.png)
Ingestion -
Automated for CSV, JSON. Schema is auto detected and auto updated as per files.
![Frame-4379-3.png](https://fractal.ai/wp-content/uploads/2023/04/Frame-4379-3.png)
Transformation -
Uses BigQuery SQL. They are then orchestrated through Composer DAGs
Customization
Code can be extended to support other file formats.
Any specific generalization can be switched off or enhanced to meet specific requirements.
Can be extended to accommodate CMEK (customer managed encryption keys) related requirements.
Benefits
- Faster onboarding on Google Cloud means faster time to market
- Decreases ramp up time by 4–8 weeks
- Standardization of solutions leads to ease of maintenance
- Configuration driven allows businesses to deploy changes faster
- Out-of-box solutions for common tasks means reduced efforts
- Better risk management leads to more predictable outcomes
Use cases
![business-concept-business-people-teamwork-discussing-1.png](https://fractal.ai/wp-content/uploads/2023/04/business-concept-business-people-teamwork-discussing-1.png)