Posts

Google Cloud Dataflow and Azure Stream Analytics

Google Cloud Dataflow and Azure Stream Analytics are both cloud-based streaming data processing services. They offer similar features, but there are some key differences between the two platforms.

Dataflow is a unified programming model and a managed service for developing and executing a wide range of data processing patterns including ETL, batch computation, and continuous computation. It is designed to scale automatically based on the data processing needs. Dataflow also offers various security features including IAM (Identity and Access Management), encryption, and audit logging.

Posts

Beyond Bash: Exploring Modern Rust-based Command-Line Utilities

Rust has emerged as a powerhouse for building robust, lightning-fast, and secure software. Its influence extends beyond web applications and systems programming—it’s also gaining traction in the realm of command-line utilities.

Here’s a curated list of Rust-powered command-line tools that can streamline your workflow, enhance productivity, and make your life in the terminal a breeze:

Text Manipulation

bat : A “cat” clone with wings, featuring syntax highlighting, Git integration, and automatic paging for seamless viewing of text files.
ripgrep: Lightning-fast, cross-platform text search tool with features like regex support, multi-threaded execution, and smart case-sensitivity handling.
fd: A blazing-fast alternative to “find”, offering intuitive filtering, intelligent search defaults, and seamless integration with other Rust tools.
dust: https://github.com/bootandy/dust): A human-friendly “du” replacement for better disk usage visualization, with color-coded output and clear information hierarchy.

File Management

exa: A modern “ls” alternative with a visually appealing interface, long file name support, Git integration, and customizable display options.
broot: A reimagined file browser for efficient navigation and file management, offering filtering, multi-pane views, and seamless integration with other tools.
delta: A syntax-highlighting pager for viewing “git diff” output, “git show” output, and plain text patches with clarity.

Development

cargo: The heart of Rust development, handling package management, dependency resolution, and project building.
rustfmt: Ensures consistent code formatting across your Rust codebase, adhering to community-defined style guidelines.
clippy: A linter offering helpful suggestions and code improvements to enhance code quality and maintainability.

Networking

hyperfine: A benchmarking tool for comparing the performance of different commands and scripts, providing accurate insights for optimization.
miniserve: A straightforward tool for serving static files over HTTP, ideal for quick file sharing and testing.

And many more!

The Rust ecosystem is brimming with innovative command-line tools. Explore and discover those that align with your specific needs and preferences!

more details

Posts

Java VMs

OpenJDK is the reference implementation of the Java Virtual Machine (JVM). It is free and open-source, and is available for a variety of platforms.
Oracle JDK is a commercial implementation of the JVM. It is developed and supported by Oracle, and includes additional features and performance optimizations over OpenJDK.
GraalVM is a high-performance JVM that is designed for modern applications. It includes a number of features that can improve the performance of Java applications, such as ahead-of-time (AOT) compilation and native image generation.
IKVM.NET is a JVM that runs on the .NET Framework. It allows Java code to be run on .NET-based platforms, such as Windows and macOS.
JRockit is a JVM that is designed for high-performance computing applications. It includes a number of features that can improve the performance of Java applications, such as garbage collection tuning and thread affinity.
Azul VM is a high-performance Java Virtual Machine (JVM) developed by Azul Systems. It is based on the OpenJDK codebase, but includes a number of performance optimizations and features that can improve the performance of Java applications.

These are just a few of the many VMs that are available for Java. The best VM for a particular application will depend on the specific requirements of the application.

Posts

Machine Learning Ops (MLOps)

MLOps stands for Machine Learning Operations. It is a set of practices that combines machine learning, DevOps, and IT operations to automate the end-to-end machine learning lifecycle, from data preparation to model deployment and monitoring.

The goal of MLOps is to make it easier to deploy and maintain machine learning models in production, while ensuring that they are reliable and efficient. MLOps can help to improve the quality of machine learning models, reduce the time it takes to get them into production, and make it easier to scale machine learning applications.

Posts

GCP and Azure networking

Azure networking and GCP networking are both comprehensive cloud networking services that offer a wide range of features and capabilities. However, there are some key differences between the two platforms.

Azure networking offers a more traditional networking model, with a focus on virtual networks (VNets), subnets, and network security groups (NSGs). VNets are isolated networks that can be used to group together resources, such as virtual machines (VMs), storage, and applications. Subnets are smaller subdivisions of a VNet, and they can be used to further isolate resources. NSGs are used to control traffic flow within and between VNets.

Posts

Crypto liquidity

The liquidity of crypto markets is a measure of how easily you can buy or sell a cryptocurrency without affecting its price. A liquid market means that there are many buyers and sellers, so you can easily find someone to take the other side of your trade. An illiquid market means that there are few buyers and sellers, so it can be difficult to find someone to trade with and the price of the cryptocurrency may be more volatile.

Posts

BigQuery ML Example

Here is an example of how to use BigQuery ML on a public dataset to create a logistic regression model to predict whether a user will click on an ad:

# Import the BigQuery ML library
from google.cloud import bigquery
from google.cloud.bigquery import Model

# Get the dataset and table
dataset = bigquery.Dataset("bigquery-public-data.samples.churn")
table = dataset.table("churn")

# Create a model
model = Model('my_model',
              model_type='logistic_regression',
              input_label_column='churn',
              input_features_columns=['tenure', 'contract', 'monthly_charges'])

# Train the model
model.train(table)

# Make a prediction
prediction = model.predict(STRUCT(tenure=12, contract='month-to-month', monthly_charges=100))

# Print the prediction
print(prediction)

This code will first create a logistic regression model named my_model. The model will be trained on a public dataset called bigquery-public-data.samples.churn. The churn dataset contains data about customer churn, with the churn column indicating whether a customer has churned. The tenure, contract, and monthly_charges columns are the input features columns.

Posts

Cryptofeed and XChange

Cryptofeed and XChange are both Python libraries that provide access to cryptocurrency exchange data. However, there are some key differences between the two projects.

Cryptofeed is more mature and has a wider range of supported exchanges. Cryptofeed currently supports over 40 exchanges, while XChange only supports a handful. Cryptofeed also has a more comprehensive set of features, including support for websockets, book validation, and multiple data formats.
XChange is more lightweight and easier to use. XChange is a simpler library that is easier to get started with. It does not have as many features as Cryptofeed, but it is still a powerful tool for accessing cryptocurrency exchange data.
Cryptofeed is more actively maintained. The Cryptofeed project is more actively maintained than XChange. This means that there are more frequent releases and bug fixes for Cryptofeed.

Overall, Cryptofeed is a more powerful and mature library for accessing cryptocurrency exchange data. However, XChange is a simpler and easier-to-use library that may be a better choice for some users.

Posts

Updating a BigQuery schema with nested fields/records

Have the producers changed the message again? Here’s how you can update your BQ schema:

# Set GCP_PROJECT and BQ_TABLE to the appropriate values
# and make sure you are authenticated.
export GCP_PROJECT=MyGcpProject
# https://cloud.google.com/bigquery/docs/reference/bq-cli-reference#resource_specification
export BQ_TABLE=myProject:myDataset.myTable

gcloud config set project ${GCP_PROJECT}

bq show \
--schema \
--format=prettyjson \
${BQ_TABLE} > schema.json

# Replace vi with your fav :-)
vi schema.json

bq update ${BQ_TABLE} schema.json

Posts

Understanding Crypto Market Makers

Here are some of the bigger crypto market makers:

Genesis Global Trading: Genesis Global Trading is a leading digital asset market maker, providing liquidity to institutions and professional traders around the world.
BitMEX: BitMEX is a cryptocurrency exchange that offers margin trading and other derivatives products. It is one of the largest cryptocurrency exchanges in terms of trading volume.
Binance: Binance is another large cryptocurrency exchange that offers a variety of trading products, including spot trading, margin trading, and futures trading. [Image of Binance logo]
Kraken: Kraken is a cryptocurrency exchange that is known for its high liquidity and security. It offers a variety of trading products, including spot trading, margin trading, and futures trading.
Coinbase: Coinbase is one of the most well-known cryptocurrency exchanges. It offers a variety of trading products, including spot trading, margin trading, and staking.
Huobi Global: Huobi Global is a cryptocurrency exchange that is known for its high liquidity and global reach. It offers a variety of trading products, including spot trading, margin trading, and futures trading.
FTX: FTX is a cryptocurrency exchange that is known for its innovative products, such as its futures contracts and leveraged tokens.
Bybit: Bybit is a cryptocurrency exchange that is known for its high leverage trading products.

These are just a few of the many crypto market makers that exist. The size and prominence of market makers can vary depending on the cryptocurrency and the exchange. It is important to do your own research before choosing a market maker to trade with.