Below you will find pages that utilize the taxonomy term “Cloud”
Posts
Scaling Up: A Guide to Building High-Volume Websites with Leading Cloud Platforms
The modern web demands websites capable of handling vast user bases, processing immense data volumes, and delivering unparalleled performance. Cloud platforms have emerged as essential tools for achieving this scalability, offering a robust infrastructure and a diverse set of features to empower website development. This article explores four leading cloud providers - AWS, GCP, Railway, Vercel, and Render - highlighting their strengths in building and scaling high-volume websites.
1. AWS: The Enterprise-Grade Solution
read morePosts
Securing Your Google Kubernetes Engine Clusters from a Critical Vulnerability
Google Kubernetes Engine (GKE) is a popular container orchestration platform that allows developers to deploy and manage containerized applications at scale. However, a recent security vulnerability has been discovered in GKE that could allow attackers to gain access to clusters and steal data or launch denial-of-service attacks.
The vulnerability is caused by a misunderstanding about the system:authenticated group, which includes any Google account with a valid login. This group can be assigned overly permissive roles, such as cluster-admin, which gives attackers full control over a GKE cluster.
read morePosts
How to Mitigate Intraday Settlement Risk
Navigating the Rapids: How to Mitigate Intraday Settlement Risk In the fast-paced world of finance, even minor hiccups can have significant consequences. One such risk, intraday settlement risk, poses a constant challenge for banks and financial institutions. But what exactly is it, and how can institutions effectively manage this risk?
Understanding Intraday Settlement Risk
Intraday settlement risk refers to the potential inability to meet payment obligations at the expected time within a single business day.
read morePosts
AWS Fargate vs. non-Fargate
Fargate vs. Non-Fargate: Choosing the Right Container Orchestration Strategy for Your Needs
In the age of cloud computing, containers have become the go-to solution for deploying and scaling applications. And when it comes to container orchestration on AWS, the two main options are Fargate and non-Fargate (which typically involves Amazon EC2 instances and Amazon ECS). But which one is right for you?
What is Fargate?
Fargate is a serverless compute engine for Amazon ECS that allows you to run containers without having to provision or manage underlying EC2 instances.
more detailsPosts
Google Cloud Run vs AWS App Runner
AWS App Runner and Google Cloud Run are two serverless computing platforms that can help you deploy and run containerized applications without having to worry about servers. Both platforms are relatively new, but they have quickly become popular choices for developers.
What are the similarities?
Both platforms are serverless, meaning that you don’t have to provision or manage servers. The platforms will automatically scale your application up or down based on demand, so you only pay for the resources that you use.
read morePosts
Google Cloud Dataflow and Azure Stream Analytics
Google Cloud Dataflow and Azure Stream Analytics are both cloud-based streaming data processing services. They offer similar features, but there are some key differences between the two platforms.
Dataflow is a unified programming model and a managed service for developing and executing a wide range of data processing patterns including ETL, batch computation, and continuous computation. It is designed to scale automatically based on the data processing needs. Dataflow also offers various security features including IAM (Identity and Access Management), encryption, and audit logging.
read morePosts
Machine Learning Ops (MLOps)
MLOps stands for Machine Learning Operations. It is a set of practices that combines machine learning, DevOps, and IT operations to automate the end-to-end machine learning lifecycle, from data preparation to model deployment and monitoring.
The goal of MLOps is to make it easier to deploy and maintain machine learning models in production, while ensuring that they are reliable and efficient. MLOps can help to improve the quality of machine learning models, reduce the time it takes to get them into production, and make it easier to scale machine learning applications.
read morePosts
GCP and Azure networking
Azure networking and GCP networking are both comprehensive cloud networking services that offer a wide range of features and capabilities. However, there are some key differences between the two platforms.
Azure networking offers a more traditional networking model, with a focus on virtual networks (VNets), subnets, and network security groups (NSGs). VNets are isolated networks that can be used to group together resources, such as virtual machines (VMs), storage, and applications.
read morePosts
BigQuery ML Example
Here is an example of how to use BigQuery ML on a public dataset to create a logistic regression model to predict whether a user will click on an ad:
# Import the BigQuery ML library from google.cloud import bigquery from google.cloud.bigquery import Model # Get the dataset and table dataset = bigquery.Dataset("bigquery-public-data.samples.churn") table = dataset.table("churn") # Create a model model = Model('my_model', model_type='logistic_regression', input_label_column='churn', input_features_columns=['tenure', 'contract', 'monthly_charges']) # Train the model model.
read morePosts
Monitor Costs in Azure
There are a few ways to monitor costs in Azure. One way is to use the Azure Cost Management + Billing portal. This portal provides a graphical interface that you can use to view your costs over time, track your spending against budgets, and identify areas where you can save money.
Another way to monitor costs is to use the Azure Cost Management API. This API allows you to programmatically access your cost data and integrate it with other systems.
read morePosts
MLOps with Kubeflow
Kubeflow is an open-source platform for machine learning and MLOps on Kubernetes. It provides a set of tools and components that make it easy to deploy, manage, and scale machine learning workflows on Kubernetes.
Kubeflow includes a variety of components, including:
Notebooks: A Jupyter notebook service that allows data scientists to develop and experiment with machine learning models.
Pipelines: A tool for building and deploying machine learning pipelines.
Experimentation: A tool for tracking and managing machine learning experiments.
read morePosts
confluent kafka vs apache beam
Confluent Kafka and Apache Beam are both open-source platforms for streaming data. However, they have different strengths and weaknesses.
Confluent Kafka is a distributed streaming platform that is used to store and process large amounts of data in real time. It is a good choice for applications that require high throughput and low latency. Kafka is also a good choice for applications that need to be fault-tolerant and scalable.
Apache Beam is a unified programming model for batch and streaming data processing.
read morePosts
AWS Lambda and GCP Cloud
AWS Lambda and Google Cloud Run are both serverless computing platforms that allow you to run code without provisioning or managing servers. However, there are some key differences between the two platforms:
Supported languages: AWS Lambda supports a wide range of programming languages including Node.js, Java, Python, Go, Ruby, and C#. Cloud Run supports Docker images, which can be written in any language. Cold start: When a Lambda function is first invoked, it takes a few milliseconds to start up.
read morePosts
Cloud gotchas 2
Serverless Serverless is great. You create your services and hand them over to AWS Lambda/GCP Cloud Run/Azure Functions and let them rip. Your system can scale up to hundreds of instances and quickly service your clients. However, you must consider
how will your downstream clients respond to such peaks in volume? Will they be able to cope? how must will auto-scaling cost? how portable is your code between serverless platforms? how will you handle bugs in the serverless platform?
read morePosts
Azure create K8 cluster
Here is a Terraform file that you can use to create a Kubernetes cluster in Azure:
provider "azurerm" { version = "~> 3.70.0" subscription_id = var.azure_subscription_id client_id = var.azure_client_id client_secret = var.azure_client_secret tenant_id = var.azure_tenant_id } resource "azurerm_resource_group" "aks_cluster" { name = var.resource_group_name location = var.location } resource "azurerm_kubernetes_cluster" "aks_cluster" { name = var.aks_cluster_name location = azurerm_resource_group.aks_cluster.location resource_group_name = azurerm_resource_group.aks_cluster.name node_count = 3 vm_size = "Standard_D2s_v3" network_profile { kubernetes_network_interface_id = azurerm_network_interface.
read morePosts
AWS vs Azure vs GCP
AWS, Azure, and GCP are the three leading cloud computing platforms in the market. They offer a wide range of services, including compute, storage, databases, networking, machine learning, and artificial intelligence.
Here are some of the key differences between the three platforms:
Market share: AWS is the market leader, with a 33% market share in 2022. Azure is second with a 22% market share, and GCP is third with a 9% market share.
read morePosts
Cloud gotchas 1
Since 2017 I’ve been involved in a wide variety of “cloud” projects and there’s some common myths I’ve observed.
Migrations are just containers Change is hard and unless you’re working for a startup, most cloud transformations start as lift and shift exercises. Contracts have been signed and everyone has been sold the myth that all you need to do is “dockerise” your containers and away you go.
Unfortunately, most of the hyperscalers (cloud provider - GCP, AWS, Azure, etc) will dazzle you with the way they’ve been doing things for years and just tell you and will instruct you to “do as they say”.
read morePosts
BigQuery ML and Vertex AI Generative AI
BigQuery ML and Vertex AI Generative AI (GenAI) are both machine learning (ML) services that can be used to build and deploy ML models. However, there are some key differences between the two services.
BigQuery ML: BigQuery ML is a fully managed ML service that allows you to build and deploy ML models without having to manage any infrastructure. BigQuery ML uses the same machine learning algorithms as Vertex AI, but it does not offer the same level of flexibility or control.
read morePosts
Predict the stock market
The premise was simple. Use “big” data analytics and machine learning models to predict the movement of stock prices. However, we had really “dirty” data and our Data Scientists were stuggling to seperate the noise from the signals. We spent a lot of time cleaning the data and introducing good old principles like “how can I run the model somewhere over than a laptop?”. This was a true startup, a bunch of people in a room trying to get stuff working.
read morePosts
Pushing the limits of the Google Cloud Platform
This one is better explained with the presentation below. If you want to learn how to run quantitative analytics at scale, it’s well worth a watch.
read more