Home » Internet

Category Archives: Internet

Ajay Ohri in Wired – An App Store for Algorithms

I got featured in a Wired article recently- this is one of my very old ideas- basically an App store for Algorithms.

I  briefly advised two startups in this space (but no longer do)

Klint Finley took time to shoot me some questions and you can read the final article here.


I have now been featured in Wired , ReadWrite Web and a member of Star Trek has reblogged me on Tumblr! Geek heaven and I owe it all to the readers of Decisionstats.com!


The Silence of the Data Science Lambs

While courts, politicians, activists , spies and even corporate leaders have spoken or ducked on the question of whole sale data collection by NSA, one group that is both in the thick of action as well as conspicuous by it’s silence is the data scientist community.


While one prominent open source member of R community spoke out against analyzing the data leaked by Wikileaks ( an admirable stand given his background) no one seems to be perturbed to be speaking on analyzing data belonging to fellow citizens and the world at the same time. ( see-WHY I WILL NOT ANALYZE THE NEW WIKILEAKS DATA)

Now Data Scientists and the Intel Communities have long worked together and one of the SAS Institute’s solid cash cows remains its strangehold on intel analytics ;) http://www.sas.com/resources/brochure/government-intelligence-community-overview-brochure.pdf) , what is perplexing is the deafening silence regarding the violation of Fourth Amendment rights of American citizens domestically and abroad  (see http://en.wikipedia.org/wiki/Fourth_Amendment_to_the_United_States_Constitution) and the active collusion by primarily data scientists in this

The right of the people to be secure in their persons, houses, papers, and effects,[a] against unreasonable searches and seizures, shall not be violated, and no Warrants shall issue, but upon probable cause, supported by Oath or affirmation, and particularly describing the place to be searched, and the persons or things to be seized

Is not your email , your social media, your mobile data a part of your person/house/paper and effect. Does Atlas need to shrug and do data scientists need to say enough is enough to stop this blatant misuse?

No- because the well funded NSA and DOD budgets will always be more than the conscience of a few data scientists. There will always be hackers for hire, and the people shall be led by a sheep.

The Numerati- or the  numerically enabled technological elite data scientists are as culpable as the agencies using them. This can be addressed by lawsuits against compliant statisticians and data miners as well as they are the ones enabling violation of fourth amendment rights. Countries like India have chosen to feed off this data trough and countries like China have chosen to create their own walled off internet instead. It is American data scientists alone who can help guide their Congress to Sanity. The timing is pertinent as Congress debates amending the Foreign Intelligence Surveillance Act

” the proposed changes would not touch the agency’s abilities overseas, which are authorized by Executive Order 12333, a Reagan-era presidential directive. The administration has declassified some rules for handling Americans’ messages gathered under the order, but the scope of that collection and other details about how the messages are used has remained unclear.”


( see – http://www.nytimes.com/2014/08/14/us/politics/reagan-era-order-on-surveillance-violates-rights-says-departing-aide.htmlhttps://www.documentcloud.org/documents/836235-ussid-sp0018.html and http://www.nytimes.com/2014/07/25/us/politics/senators-bill-is-stricter-on-nsa-than-houses.html)

Are you a data scientist who wants to help out? Help the ACLU educate Congress ( https://www.aclu.org/national-security/fix-fisa-end-warrantless-wiretapping )on the proper way to dispose off private data, and anonymize the data already collected .


Otherwise the next generations will be born in an age where every move recorded by CC cameras or wearable computing devices will be mined by corporations for ads and governments for threats.

The last word on this was said by a wise old White Man ( in an age where Wise Old White Men are no longer fashionable or even correct)

The tree of liberty must be refreshed from time to time with the blood of patriots and tyrants–Thomas Jefferson

Maybe he was referring to this tree




The first Decisionstats.com Intern

The following certificate is awarded to Chandan Routray (https://www.linkedin.com/in/chandanroutray) , a 2yr student at an IIT who learnt all this

and wrote some of  this at https://python4analytics.wordpress.com/- the following certificate to show he was tested as a potential scientist and showed great promise in executing his task!



Big Data Shoes

The internet is a ponderful and wonderful place for serendipity


Using Windows Azure Machine Learning as a service with R #rstats

A Brief Tutorial I wrote by playing with the software at manage.windowsazure.com

Happy July 4th

To all my American friends.


Great Way to learn Git easily

a great way to learn Git easily is here https://try.github.io/

Screenshot 2014-06-24 19.23.59

This is a much better designed code school project than the one for R


However Swirl is a great way to learn  R in an interactive way. its only drawback is it needs to be integrated with something like http://www.r-fiddle.org/#/ for a true automated browser only version

Why do I favor automated elearning solutions now? Because teaching the same thing again and again can be boring for the teacher and videos can be boring for the students. Note how the potential student is given positive reinforcement to boost his morale, something any good teacher know.


Get every new post delivered to your Inbox.

Join 843 other followers