sanj.dev

HOW TO UNDELETE BIGQUERY TABLES USING TIME TRAVEL

One hour ago: bq cp mydataset.table@-3600000 mydataset.table_restored Absolute (ms since UNIX epoch) GMT: Wednesday, 26 May 2021 13:41:53 = 1622036513000 https://www.epochconverter.com/

11/1/2022 gcp bigquery cloud data tutorial

JAVA 17 LTS: ENTERPRISE FEATURES GUIDE

Major Features in Java 17 LTS Enhanced Random Number Generation Pseudo-Random Number Generators (PRNGs) are getting a major update in Java with the release of JEP 356. New interfaces and implementations make it easier to use different algorithms interchangeably and offer better support for stream-based programming. This is a great improvement for Java developers who require randomness in their applications.

15/12/2021 java jvm enterprise security programming

CREATING BLOG SERIES IN HUGO: PART 2

In this second part of the series we’ll show you where to find the full series

15/8/2021 hugo tutorial development content web

CREATING BLOG SERIES IN HUGO: PART 1

In this first part of the series we’ll show you how to create a series

14/8/2021 hugo tutorial development content web

SETTING UP PAGE REDIRECTS IN HUGO

Forwarding to gohugo using redirectUrl

20/6/2021 web hugo tutorials frontend development

KAFKA VS GOOGLE CLOUD PUBSUB: 2020 COMPARISON

Feature Confluent Cloud Kafka Google Cloud Pubsub Notes Data Retention Set retention per topic in Confluent Cloud, including unlimited retention with log compaction. Retains unacknowledged messages in persistent storage for 7 days from the moment of publication. There is no limit on the number of retained messages. Have to write custom subscriber/publisher to save beyond 7 days [L] + ongoing BAU [S] Replay A consumer request an “offset”, however the retention period is dictated by the broker config “Snapshots” can be created for later replay by these are limited to 7 days as per retention policy. As per above, a custom subscriber/publisher to save/replay message. [L] + ongoing BAU [S] Message Ordering Yes, within a partition, In general, messages are written to the broker in the same order that they are received by the producer client. No, Pub/Sub provides a highly-available, scalable message delivery service. The tradeoff for having these properties is that the order in which messages are received by subscribers is not guaranteed. While the lack of ordering may sound burdensome, there are very few use cases that actually require strict ordering. Delivery Semantics Exactly-once delivery semantics At-least-once, exactly once possible with dataflow Latency Advertised at being able to “Achieve sub 30 ms latency at scale”, no mention of this in SLA. Pub/Sub does not guarantee message delivery latency Uptime “Service Level” Monthly Uptime Percentage of at least 99.95%.Is this 99.95% of GCP’s 99.95%? “Service Level Objective” Monthly Uptime Percentage to Customer of at least 99.95% Both offer credits to account if not met but this is unlikely to be suitable for an enterprise org Schema Registry Yes for Avro schemas and very new (GA August 2019) No, Data Catalog is in beta and could be used to build one [XL] IAM / ACL “Preview” for Role/SAML, ACLs You are provided auth keys that you need to store/share/rotate. These could be stored in Cloud KMS, however, this would need to be automated [XL] + ongoing BAU [M] Standard Google IAM Encryption Yes in transit and at rest with NO payload encryption. Clients are responsible for writing custom encryption/decryption connecting to (e.g) Cloud KMS. A custom library would need to be written that is used by everyone for publishing and subcribing [L] Cloud KMS(HSM/Software/BYOK/External Key Manager) with CMEK VPC Security Unknown/No, can Confluent Cloud be made to respect VPC service controls? VPC Service Controls protection applies to all push and pull operations except existing Pub/Sub push subscriptions Stream Processing “fully-managed KSQL”, no kafka streams, would have to run Kafka Streams/Storm cluster connecting to Confleunt Cloud which is likely to introduce latency Apache Beam / Cloud Dataflow fully managed Costs per 130GB $37 $39 Based on example calc on confluent and google pricing calc Priority Queues Yes No, but can segregate by topic Multi-zone high availability Not advertised “Contact Confluent” Yes Related Articles Building a Modern Data Lakehouse: Patterns, Tools, and Real-World Workflows for 2025 DuckLake & Iceberg: Modern Lakehouse Architecture 2025

28/12/2020 cloud gcp messaging systemdesign comparison

FINDING GOOGLE CLOUD IP RANGES: REFERENCE GUIDE

GCP Cloud IPs by region IP range with geolocation More info here

12/10/2020 gcp networking cloud security comparison

MULTI-LANGUAGE PIPELINES WITH APACHE BEAM

Apache Beam is an open-source unified programming model and framework for defining and executing big data processing pipelines. It provides a way to write data processing code that is portable across different execution engines or runtimes, such as Apache Flink, Apache Spark, Google Cloud Dataflow, and more.

25/8/2020 big-data streaming cloud architecture development

GETTING STARTED WITH TERRAFORM CLOUD DEVELOPMENT KIT

Terraform’s Cloud Development Kit (CDK) let’s you use other languages to define your cloud infra.

25/7/2020 terraform devops infrastructure cloud development

TERRAFORM 0.13: KEY FEATURES AND IMPROVEMENTS

No more copying and pasting modules.

24/6/2020 terraform devops infrastructure cloud development

…