Weird Data

random data analysis and visualization :)

Home About
avatar

Spurious Correlations - Indian Premier League Edition

By Rohit Suratekar on 13 December 2020

When we are analysing big data, many times we encounter weird correlations. There is a scientific name for it - Spurious Relationship or Spurious Correlation 1. You may have heard about the book by mathematician Tyler Vigen which talks about this phenomenon. It is no surprise that I often encounter...

avatar

Visualizations Collection 2020

By Rohit Suratekar on 7 December 2020

For most of the people, the year 2020 was a disaster in many aspects, but it turned out pretty good for my data analysis streaks. I did a lot of data analysis on many data-sets ranging from Running Records to World's Tea Export. Some received positive response while on others...

avatar

Creating an artificial screenplay with neural networks

By Rohit Suratekar on 16 October 2020

Natural Language Processing (NLP) is one of the highly researched topics in the field of Artificial Intelligence (AI). It is widely used in different contexts like language translation, personal assistants (like Siri), spam detection, document tagging, speech recognition etc. Researchers are developing new methods and tools every day related to...

avatar

Yet Another Movie Recommendation System

By Rohit Suratekar on 1 October 2020

In this age of the internet, you must have encountered one or the other form of Machine Learning (ML). It can be a simple captcha, language translation or all the advertisements that you see on various websites. ML has become a part of our daily lives. This blog post is...

avatar

Matplotlib vs SecretColors

By Rohit Suratekar on 17 June 2019

This post compares the color palettes of matplotlib and SecretColor. I recommend you to read basic concept behind this library first if you are planning to use this library in your workflow. However, if you are here to just enjoy visualization, then go ahead :) Note that this library is...

avatar

SecretColors- A Comprehensive Tutorial

By Rohit Suratekar on 11 June 2019

In my previous post, I introduced readers to my new python library called SecretColors to get excellent colors to your python workflow. Essentially this library has collections of various different color palettes and very easy API to access them without searching for hex codes or RGB values. This blog post...

avatar

Academic Publications in India

By Rohit Suratekar on 12 March 2019

A few days ago, India celebrated its National Science Day which marks the scientific achievement of Sir C. V. Raman and also inspires many Indians to pursue science. In 1996, government budget for research and development was 0.65% of GDP while same was decreased to 0.65% of GDP in 2015...

avatar

Grant Proposal, Science Book and Fiction

By Rohit Suratekar on 15 September 2018

I recently saw a news that one of the biggest scientific grant agency, Wellcome Trust, is sharing details of eligible applications for Open Research Fund (ORF). As I am in process of various postdoctoral fellowship applications, I got curious to know how other researchers write their proposal. I read few...

avatar

SecretColors - Make Plots Great Again !

By Rohit Suratekar on 10 September 2018

Update: New and updated version of this blog is now available here. I am obsessed with good colors. I particularly pay attention to all the colors I have used in my illustrations and graphics. This is extended to my data visualization also. There is huge amount of literature on how...

avatar

Indian Railways part III - geographical distribution

By Rohit Suratekar on 25 August 2018

In my previous post I have described how the Indian Railway database is rich in information and how we can visualize this data in the better way. This post is a continuation of that analysis. Here, we will explore how this data can be visualized by using a real map....

avatar

Indian Railways part II - Old versus New

By Rohit Suratekar on 22 August 2018

This post is an outcome of short analysis I discussed in my previous post. While checking the distribution of trains across different railway stations, I came into a problem of differentiating regular long distance railways versus local and suburban trains. At that time I was not aware of any database...

avatar

Indian Railways part I - travel to the Moon

By Rohit Suratekar on 25 July 2018

Indian Railways is the largest railway network by passenger transport 1 while 4th largest by total track route 2. This system is a backbone of long-distance Indian transport providing affordable commute to a commoner. Fig 1 shows an overview of the complicated network of railway tracks spanning all over India....

avatar

How far can you reach by walking?

By Rohit Suratekar on 15 July 2018

I recently ‘re-watched’ movie Forrest Gump. Tom Hanks’ acting was magical. There is scene in the movie where he starts running whole day and roams around the city. It got me thinking about such random running. If I were to do something like that what different areas can I visit...

avatar

Distribution of banks across India

By Rohit Suratekar on 7 July 2018

I was developing one android app for storing bank account details securely. One of the fuction I was thinking of adding was to search bank IFSC code. For this purpose I needed database of all Indian banks who have valid IFSC code. This code is imporant for performing NEFT or...