Clean, clear, and connected: Explaining Data Deduplication and Entity Resolution

Posted on

In today’s data-driven business landscape, effective Master Data Management is key.  The amount of data enterprises generate and collect is growing at unprecedented speed.  Everyone agrees that data can deliver the edge a business needs to be successful.  However, the requirement to be data-driven has pushed many businesses to frantically acquire as much data as …

Read More

AI in eCommerce: What is it and how does it work?

Posted on

Here you are, reading this article – most likely because some AI-powered algorithm made it available for you based on your search and other information it might have on you. I know this sounds a bit scary. And the truth is that, for most people, Artificial Intelligence (AI) and its closely connected sub-term machine learning …

Read More

Personalization Made Easy: A How-To Approach with Customer, Transaction, and Product Data

Posted on

Personalization is a critical aspect of any successful retail or e-commerce strategy. By tailoring your messages, offers, and product recommendations to each customer’s unique interests and preferences, you can create a more engaging and meaningful experience that drives loyalty and increases sales. However, personalization is only effective if you have the right data. In this …

Read More

Designing a smarter search box for a website

Posted on

Research by renowned analysts is also clear – as many as 43% of web shop visitors first go to “search”, and as many as 39% of all online customers admit that “search” significantly influenced their purchase. Even if your website is organized perfectly, customers will still get confused and some simply prefer search mechanics.  Yet, …

Read More

Our strategy for building products

Posted on

When we started building our product, we focused on the Recommender engine and Segmentation. As a startup, we embraced agility and focus on the market. At the beginning of 2020, we decided that our product must become a reflection of our team. Then we said it must be a “no bullshit” product, agile or, in …

Read More

How to combine Visual Search and IM

Posted on

In this text we will try to briefly explain IM applications integration with Visual Search. You can read more about Visual Search implementation in our last blog post. Instant Messaging (IM) and Chatbots Among all other benefits, The Internet has transformed and simplified how people communicate with each other. In addition to email, IM has …

Read More

A closer view on Data Science Delivery

Posted on

According to Gartner, only 15% to 20% of data science projects get completed. Of those projects that did complete, CEOs say that only about 8% of them generate value. Despite these facts, data science is still considered as an opportunity for business growth. These facts are something that always lingers in the back of everyone’s …

Read More

Be one step ahead: Solver AI Suite short overview

Posted on

Intro and motivation In the beginning, it started as the three separate projects. The first one for managing machine learning models known as MoMa, the second one for forecasting using multiple models to give the best results (Fibi), and the third one was a business solution for segmentation and recommender system with personalized view and …

Read More

Introduction to recommender systems

Posted on

After watching Udemy online course Building Recommender Systems with Machine Learning and AI, I came up with the idea to write a text that can help beginners to understand the basic ideas of the recommender systems. A recommender system, or a recommendation system is a subclass of information filtering system that seeks to predict the …

Read More

How to model better recommendations in Covid time?

Posted on

Needs in the IT sector are constantly changing. After Coronavirus hit us unexpectedly, this is true, more than ever. In order to keep distance from each other, we were forced to limit all our activities that can’t be done online. At first, this was shocking and we weren’t prepared for such change. But when we …

Read More

HOW TO START WITH DATA SCIENCE?

Posted on

After participating in a meetup at the end of March, subjected “Data Science – what is it?”, a lot of people contacted me to send them some introductory materials to help them get started with learning. It took me a long time to sit down and start compiling a list, because there are many sources, …

Read More

Dash by Plotly

Posted on

Let’s say you have been working on a project for clients segmentation. You have your client segments well separated and your final task is to present findings and results to the project stakeholders. Usual situation is that none of them have that level of technical expertise to understand your code so you need to visualize …

Read More

Hello Docker

Posted on

Having spent couple of weeks on data preparation and developing that particular machine learning model, you are finally ready to show off with some really good results to your boss. You have your notebooks with lines of code doing magic, maybe some reports in Excel,  amazing visualizations in Plotly etc. It’s 5 minutes till your presentation …

Read More

Anomaly detection

Posted on

The problem of anomaly detection is a very challenging problem often faced in data analysis. Whether it is about clustering, classification or some other machine learning problem, it is of great importance to identify anomalies and handle them in some way, in order to achieve optimal model performances. Furthermore, anomalies could often influence the analysis …

Read More

Handling missing data

Posted on

Hi, everyone. Although I planned for my next post to be about anomaly detection and their treatment,  I faced some other type of problem that quickly escalated into huge issue affecting the modelling and results accuracy, and couldn’t resist to share my experience as soon as possible. In this post, I will be talking about …

Read More

Forecasting with VAR and Prophet

Posted on

In my previous post, I tried to present the ARIMA model for forecasting. It was based on the use of autoregression and moving average concepts, combining the regression of variable based on its lagged values and calculation of error based on the linear combination of error terms occurred in the past, respectively. In this post, …

Read More

Interactive log analysis with Apache Spark

Posted on

The Internet is becoming the largest global shop across markets, and anyone who is offering products and services of any kind prefers for web shops to become the primary outlets to supply customers. This leads to a reduction in the number of employees and traditional brick and mortar branches and reduction in costs, so it …

Read More

Forecasting with ARIMA

Posted on

One of the most challenging machine learning problems is predicting some output based on the history of its previous values. The complexity of the problem multiplies as new features and constraints are added to analysis. Thus, in time series analysis it is not always enough to use previous values only, there often are many features …

Read More