Blog Archives

Data science with Microsoft Fabric – Plotting ROC curve and distribution of scores

ROC (Receiver Operation Characteristics) – curve is a graph that shows how classifiers performs by plotting the true positive and false positive rates. It is used to evaluate the performance of binary classification models by illustrating the trade-off between True

Tagged with: , , , , , , , , ,
Posted in Azure Machine Learning, Fabric, R, Spark

Data engineering functions on large datasets in Microsoft Fabric

Data engineering and even simple data wrangling functions in Fabric can make several tasks faster, when you know know, which package (language) to choose. By comparing Python Pandas with PySpark Pandas (Koalas), we will see that there are huge performance

Tagged with: , , , , , , , ,
Posted in Fabric, Spark

Comparing performances of CSV to RDS, Parquet, and Feather file formats in R

From the previous blogpost:– CSV or alternatives? Exporting data from SQL Server data to ORC, AVRO, Parquet, Feather files and store them into Azure data lake we have created Azure blob storage, connected secure connection using Python and started uploading files

Tagged with: , , , , , , ,
Posted in Uncategorized

SQL vs. NoSQL for Data Science

Data come in variety of form, at different pace, and at different volume. And if all three criteria define the difference between SQL and NoSQL and there, all three are still irrelevant for data science.

My theorem is, that no matter what shape, size, frequeny, value and trustworthiness, SQL type of presenting the data is still the number one player.

Tagged with: , , , , , , ,
Posted in thoughts, Uncategorized

Advent of 2021, Day 25 – Spark literature, documentation, courses and books

Series of Apache Spark posts: Dec 01: What is Apache Spark Dec 02: Installing Apache Spark Dec 03: Getting around CLI and WEB UI in Apache Spark Dec 04: Spark Architecture – Local and cluster mode Dec 05: Setting up Spark Cluster Dec 06: Setting up IDE Dec

Tagged with: , , , , , , , , , ,
Posted in Azure Databricks, Spark, Uncategorized

Advent of 2021, Day 24 – Data Visualisation with Spark

Series of Apache Spark posts: Dec 01: What is Apache Spark Dec 02: Installing Apache Spark Dec 03: Getting around CLI and WEB UI in Apache Spark Dec 04: Spark Architecture – Local and cluster mode Dec 05: Setting up Spark Cluster Dec 06: Setting up IDE Dec

Tagged with: , , , , , , , , ,
Posted in Spark, Uncategorized

Advent of 2021, Day 23 – Delta live tables with Azure Databricks

Series of Apache Spark posts: Dec 01: What is Apache Spark Dec 02: Installing Apache Spark Dec 03: Getting around CLI and WEB UI in Apache Spark Dec 04: Spark Architecture – Local and cluster mode Dec 05: Setting up Spark Cluster Dec 06: Setting up IDE Dec

Tagged with: , , , , , , ,
Posted in Azure Databricks, Spark, Uncategorized

Advent of 2021, Day 22 – Spark in Azure Databricks

Series of Apache Spark posts: Dec 01: What is Apache Spark Dec 02: Installing Apache Spark Dec 03: Getting around CLI and WEB UI in Apache Spark Dec 04: Spark Architecture – Local and cluster mode Dec 05: Setting up Spark Cluster Dec 06: Setting up IDE Dec

Tagged with: , , , , , , ,
Posted in Azure Databricks, Spark, Uncategorized

Advent of 2021, Day 21 – Spark GraphX operators

Series of Apache Spark posts: Dec 01: What is Apache Spark Dec 02: Installing Apache Spark Dec 03: Getting around CLI and WEB UI in Apache Spark Dec 04: Spark Architecture – Local and cluster mode Dec 05: Setting up Spark Cluster Dec 06: Setting up IDE Dec

Tagged with: , , , ,
Posted in Spark, Uncategorized

Advent of 2021, Day 20 – Spark GraphX processing

Series of Apache Spark posts: Dec 01: What is Apache Spark Dec 02: Installing Apache Spark Dec 03: Getting around CLI and WEB UI in Apache Spark Dec 04: Spark Architecture – Local and cluster mode Dec 05: Setting up Spark Cluster Dec 06: Setting up IDE Dec

Tagged with: , , , , ,
Posted in Spark, Uncategorized
Follow TomazTsql on WordPress.com
Programs I Use: SQL Search
Programs I Use: R Studio
Programs I Use: Plan Explorer
Rdeči Noski – Charity

Rdeči noski

100% of donations made here go to charity, no deductions, no fees. For CLOWNDOCTORS - encouraging more joy and happiness to children staying in hospitals (http://www.rednoses.eu/red-noses-organisations/slovenia/)

€2.00

Top SQL Server Bloggers 2018
TomazTsql

Tomaz doing BI and DEV with SQL Server and R, Python, Power BI, Azure and beyond

Discover WordPress

A daily selection of the best content published on WordPress, collected for you by humans who love to read.

Revolutions

Tomaz doing BI and DEV with SQL Server and R, Python, Power BI, Azure and beyond

Reeves Smith's SQL & BI Blog

A blog about SQL Server and the Microsoft Business Intelligence stack with some random Non-Microsoft tools thrown in for good measure.

SQL Server

for Application Developers

Business Analytics 3.0

Data Driven Business Models

SQL Database Engine Blog

Tomaz doing BI and DEV with SQL Server and R, Python, Power BI, Azure and beyond

Search Msdn

Tomaz doing BI and DEV with SQL Server and R, Python, Power BI, Azure and beyond

R-bloggers

Tomaz doing BI and DEV with SQL Server and R, Python, Power BI, Azure and beyond

Data Until I Die!

Data for Life :)

Paul Turley's SQL Server BI Blog

sharing my experiences with the Microsoft data platform, SQL Server BI, Data Modeling, SSAS Design, Power Pivot, Power BI, SSRS Advanced Design, Power BI, Dashboards & Visualization since 2009

Grant Fritchey

Intimidating Databases and Code

Madhivanan's SQL blog

A modern business theme

Alessandro Alpi's Blog

DevOps could be the disease you die with, but don’t die of.

Paul te Braak

Business Intelligence Blog

Sql Insane Asylum (A Blog by Pat Wright)

Information about SQL (PostgreSQL & SQL Server) from the Asylum.

Gareth's Blog

A blog about Life, SQL & Everything ...

SQLPam's Blog

Life changes fast and this is where I occasionally take time to ponder what I have learned and experienced. A lot of focus will be on SQL and the SQL community – but life varies.

William Durkin

William Durkin a blog on SQL Server, Replication, Performance Tuning and whatever else.

$hell Your Experience !!!

As aventuras de um DBA usando o Poder do $hell

Design a site like this with WordPress.com
Get started