Blog

Integration testing with Ranger

06/15/17
06/15/17

Integration testing with Ranger

By Milan Milosevic
06/08/17

The Curse of Simplification

A short, non-technical narrative on complexity in Data Science projects and how ...

By Goran S. Milovanovic, PhD
06/02/17

SmartCat Values: Start with "Why?"

  Why did we organize this meeting? Why do I have to write the minutes of meeti ...

By Bojan Kovac
05/26/17

Load testing Kafka with Ranger

The best way to test an infrastructure before going into production is to mimic ...

By Matija Gobec
05/18/17

Fast matrix factorization in R

This article will be a wrap-up of our series related to collaborative filtering ...

By Stefan Nikolic
04/27/17

SmartCat Values: Knowledge is power

When we decided to start our own company, back in 2015, the first thing we did a ...

By Bojan Kovac
04/07/17

Hybrid content-based and collaborative filtering recommendations with {ordinal} logistic regression (2): Recommendation as discrete choice

In this continuation of "Hybrid content-based and collaborative filtering recomm ...

By Goran S. Milovanovic, PhD
03/27/17

Hybrid content-based and collaborative filtering recommendations with {ordinal} logistic regression (1): Feature engineering

  I will use {ordinal} clm() (and other cool R packages such as {text2vec} as w ...

By Goran S. Milovanovic, PhD
03/16/17

Twitalyzr

You shall not pass! This pretty much sums up the main reason why Twitalyzr was m ...

By Stanko Kuveljic
03/10/17

Data Science Unicorns and Where to Find Them

Each project in this Big Data world is going through a carefully paved path. Fir ...

By Nenad Bozic
02/22/17

#AskNASA: What's the Optimal Time for Aliens to Invade Earth?

My inaugural blog as a Data Science Consultant for SmartCat. The code that accom ...

By Goran S. Milovanovic, PhD
02/21/17

Improved R implementation of collaborative filtering

Collaborative filtering (CF) is one of the most popular techniques for building ...

By Stefan Nikolic
02/01/17

Challenges of Monitoring Distributed Systems

Last October one of our co-founders and senior consultants Nenad Bozic held a pr ...

By Nenad Bozic
01/17/17

Where is my data - debugging SSTables in Cassandra

Apache Cassandra is great for handling huge volumes of data. Everything works re ...

By Nenad Bozic
01/09/17

MongoDB vs Couchbase - part two

This is the round two in comparing MongoDB vs Couchbase. In round one, we saw th ...

By Milan Milosevic
12/27/16

Cassandra Tuning - Above and Beyond

This September one of our co-founders and senior consultants Matija Gobec held a ...

By Matija Gobec
12/13/16

Tuning Java Driver for Heavy write and Low Latency Read Scenario

In the first two blog posts (part 1 and part 2) we gave a couple of pointers abo ...

By Nenad Bozic
11/29/16

Recommender Systems: Matrix operations for fast calculation of similarities

Recommender systems have become ubiquitous and very important in recent years. T ...

By Stefan Nikolic
11/16/16

Tuning DataStax Java Driver for Cassandra - Part 2

In first part of this blog post series we covered basic settings which can give ...

By Nenad Bozic
11/10/16

Tuning DataStax Java Driver for Cassandra - Part 1

When people think of tuning Apache Cassandra to perform better, their first inst ...

By Nenad Bozic
10/14/16

The Next Generation of OSS Software Won’t Be Apache

The Apache Software Foundation (ASF) has been a steward of free open source soft ...

By Scott Hirleman
10/03/16

Intro to Document-Oriented NoSQL Databases

This is the first post in the series about comparing MongoDB with Couchbase, whi ...

By Milan Milosevic
09/28/16

Systemd Or How I learned to stop worrying and love newness

Working in the IT world where things are not yet fully connected, integrated a ...

By Nikola Ivancevic
09/11/16

Metric Collection Stack for Distributed Systems

In our previous post we referred to the subject of having logs in a central plac ...

By Nenad Bozic
08/17/16

Slow Queries Monitoring

Working on high nines where the latency of every query matters is a whole differ ...

By Nenad Bozic
06/30/16

Distributed logging

Browsing through logs is always hard, even when you are on a single node system. ...

By Nenad Bozic
06/24/16

After a mile in your own shoes

Spoiler Alert: This article is not technology-focused. It’s people-focused. Ton ...

By Bojan Kovac
06/14/16

What's new in Apache Cassandra 3.0 - part 2

In the part one of “What’s new in Cassandra 3.0” I got into details about materi ...

By Matija Gobec
05/11/16

Monitoring stack for distributed systems

Microservice architecture on the one hand, and distributed systems on the other, ...

By Nikola Ivancevic
05/05/16

Craft conference 2016

This was our third Craft Conference, the place to be if you are connected to IT ...

By Nenad Bozic
03/21/16

Polyglot Persistence in NoSQL Space

Relational databases have been around for a long time, developers tend to use th ...

By Nenad Bozic
03/14/16

To walk a mile in client's shoes...

     It was late one Wednesday evening, 8:30 pm. We were pulling long hours befo ...

By Bojan Kovac
03/01/16

Introduction to Apache Kafka

In my previous blog I wrote about distributed systems and why we choose this pat ...

By Matija Gobec
02/25/16

How (not) to start with Apache Cassandra

Within several previous projects, we have held consultations for development tea ...

By Nenad Bozic
02/08/16

Functional testing of email communication

Functional testing series Blackbox testing microservices Graybox testing - C ...

By Nenad Bozic
12/21/15

What's new in Apache Cassandra 3.0 - part 1

In the world of a fast growing number of NoSql databases and fast, scalable and ...

By Matija Gobec
12/15/15

Go CD - Continuous delivery through pipelines

In order to compete in today’s IT market, you must be truly agile, you must list ...

By Nenad Bozic
12/06/15

Front-end first development?

Whenever a new product or new feature implementation is ahead of us, there are m ...

By Bojan Kovac
12/04/15

What it means to be a geek

I walked through the awakening downtown of Novi Sad. It was early. People with t ...

By Bojan Kovac
11/18/15

How to hire a good data scientist and avoid fake ones?

With the current “big data” hype there is a big demand for skilled and knowledge ...

By Milos Grubjesic
10/14/15

Graybox testing - Control your dependencies

Functional testing series Blackbox testing microservices Graybox testing - C ...

By Nenad Bozic
10/06/15

Bring functional tests closer to business with Cucumber

Functional testing series Blackbox testing microservices Graybox testing - C ...

By Nenad Bozic
10/02/15

Cassandra Summit 2015

As this was the biggest NoSQL event in the world and the biggest gathering of Ca ...

By Nenad Bozic
08/25/15

Spring batch as framework for system integration

We had finished up the first set of requirements for some project and obtained a ...

By Nenad Bozic
08/17/15

Spark + Cassandra: The perfect match

Hadoop has been the leading platform for distributed data storage and analytics ...

By Matija Gobec
08/07/15

Leveraging parallel execution

With NoSql databases comes change in physical data modelling. When it comes to t ...

By Matija Gobec
07/22/15

Setting up Embedded Cassandra on Spring project

When we first started using Cassandra, we immediately realized there would be a ...

By Nenad Bozic
07/20/15

Cassandra migration tool

Developing a product usually means that during the period of development you are ...

By Matija Gobec
07/16/15

Migrating time series data from MySql to Cassandra

MySql is still widely used in application development as a stable, fairly perfor ...

By Matija Gobec
07/07/15

Blackbox testing microservices

Functional testing series Blackbox testing microservices Graybox testing - C ...

By Nenad Bozic
06/27/15

Why go distributed

Why go distributed? When talking to other fellow engineers and people in our ind ...

By Matija Gobec
06/15/15

Cassandra complex queries - lessons learned

Just a couple of years ago, the decisions faced by software architects were quit ...

By Nenad Bozic
05/17/15

Why Big Data

We wanted to share why we want to do what we do and why we think this is the fut ...

By Nenad Bozic
05/04/15

Craft conference 2015

This was our second a craft conference, the place to be if you are connected to ...

By Nenad Bozic