Dylan's BI Study Notes

My notes about Business Intelligence, Data Warehousing, OLAP, and Master Data Management

Archive for the ‘Data Science’ Category

Semantic Layer for Data Scientist

Posted by Dylan Wan on June 16, 2022

I recently read a good buyer’s guide from AtScale: The Buyer’s Guide to the Best Semantic Layer Tools for Data and Analytics. I think the buyer’s guide is fair not because that the company I worked for also has such semantic layer, but I really feel that the drawback of the vendor specific semantic layer has a unnecessary vendor lock-in and the design is not customer-oriented.

Read the rest of this entry »

Posted in BI, Data Science | Tagged: | Leave a Comment »

Schema On Read?

Posted by Dylan Wan on September 24, 2017

I saw “create external table ” first in Oracle DBMS 11G.

It was created for the purpose of loading data.

When Hive was introduced, a lot of data were already created in HDFS.

Hive was introduced to provided the SQL interface on these data.

Using the external table concept is a nature of the design.  It is not really a creative thing.

Read the rest of this entry »

Posted in Big Data, Data Lake, Data Science, Oracle | Tagged: , | Leave a Comment »