The following post discusses the method of ‘percentage correct’ predictions and explains why it may not be the most precise method to measure performance. I also examine the topic of...
Read full article →In this article, I continue exploring Logging as a data set. I have described this type of datasets earlier in Log Management and Big Data Analytics post. In this section,...
Read full article →In the following article, I explore the issue of log collection and analysis, a very specific problem domain for many large organizations. The logging is a suitable example of a...
Read full article →This is a short guide on how to install Hadoop single node cluster on a Windows computer without Cygwin. The intention behind this little test, is to have a test...
Read full article →The following article analyses the applicability of the CAP theorem to Big Data. I will explain the CAP theorem, explore the three of its characteristics, as well as provide the...
Read full article →Microtargeting (also micro-targeting or micro-niche targeting) is one of the methods that is used by the marketing sector to analyze consumer data collected from various sources to detect interests of...
Read full article →The following article is my attempt at exploring a niche market of Smart IoT Door Locking Solutions and partially investigate how Big Data analytics could improve this specific sector and...
Read full article →The following use case is my attempt at denoting the importance of Big Data in reference to the world’s largest food companies and their current impact on the overall trend...
Read full article →Volume is the most characteristic property of Big Data, which to a large extent affects the other five V’s of Big Data, namely the Velocity, Variety, Veracity, Variability, and Value....
Read full article →This short post talks about the PID services and how they are used in all components of Big Data Architecture Framework. Persistent Identifier (PID) services A critical function that spreads...
Read full article →