I got featured in a Wired article recently- this is one of my very old ideas- basically an App store for Algorithms.
I briefly advised two startups in this space (but no longer do)
- this was the original article in ReadWriteWeb http://readwrite.com/2011/06/01/an-app-store-for-algorithms
Klint Finley took time to shoot me some questions and you can read the final article here.
I have now been featured in Wired , ReadWrite Web and a member of Star Trek has reblogged me on Tumblr! Geek heaven and I owe it all to the readers of Decisionstats.com!
While courts, politicians, activists , spies and even corporate leaders have spoken or ducked on the question of whole sale data collection by NSA, one group that is both in the thick of action as well as conspicuous by it’s silence is the data scientist community.
While one prominent open source member of R community spoke out against analyzing the data leaked by Wikileaks ( an admirable stand given his background) no one seems to be perturbed to be speaking on analyzing data belonging to fellow citizens and the world at the same time. ( see-WHY I WILL NOT ANALYZE THE NEW WIKILEAKS DATA)
Now Data Scientists and the Intel Communities have long worked together and one of the SAS Institute’s solid cash cows remains its strangehold on intel analytics ;) http://www.sas.com/resources/brochure/government-intelligence-community-overview-brochure.pdf) , what is perplexing is the deafening silence regarding the violation of Fourth Amendment rights of American citizens domestically and abroad (see http://en.wikipedia.org/wiki/Fourth_Amendment_to_the_United_States_Constitution) and the active collusion by primarily data scientists in this
The right of the people to be secure in their persons, houses, papers, and effects,[a] against unreasonable searches and seizures, shall not be violated, and no Warrants shall issue, but upon probable cause, supported by Oath or affirmation, and particularly describing the place to be searched, and the persons or things to be seized
Is not your email , your social media, your mobile data a part of your person/house/paper and effect. Does Atlas need to shrug and do data scientists need to say enough is enough to stop this blatant misuse?
No- because the well funded NSA and DOD budgets will always be more than the conscience of a few data scientists. There will always be hackers for hire, and the people shall be led by a sheep.
The Numerati- or the numerically enabled technological elite data scientists are as culpable as the agencies using them. This can be addressed by lawsuits against compliant statisticians and data miners as well as they are the ones enabling violation of fourth amendment rights. Countries like India have chosen to feed off this data trough and countries like China have chosen to create their own walled off internet instead. It is American data scientists alone who can help guide their Congress to Sanity. The timing is pertinent as Congress debates amending the Foreign Intelligence Surveillance Act
” the proposed changes would not touch the agency’s abilities overseas, which are authorized by Executive Order 12333, a Reagan-era presidential directive. The administration has declassified some rules for handling Americans’ messages gathered under the order, but the scope of that collection and other details about how the messages are used has remained unclear.”
( see – http://www.nytimes.com/2014/08/14/us/politics/reagan-era-order-on-surveillance-violates-rights-says-departing-aide.html , https://www.documentcloud.org/documents/836235-ussid-sp0018.html and http://www.nytimes.com/2014/07/25/us/politics/senators-bill-is-stricter-on-nsa-than-houses.html)
Are you a data scientist who wants to help out? Help the ACLU educate Congress ( https://www.aclu.org/national-security/fix-fisa-end-warrantless-wiretapping )on the proper way to dispose off private data, and anonymize the data already collected .
Otherwise the next generations will be born in an age where every move recorded by CC cameras or wearable computing devices will be mined by corporations for ads and governments for threats.
The last word on this was said by a wise old White Man ( in an age where Wise Old White Men are no longer fashionable or even correct)
The tree of liberty must be refreshed from time to time with the blood of patriots and tyrants–Thomas Jefferson
Maybe he was referring to this tree
If the NSA can be sued for collection of data, why cant Google be sued for sharing my data with NSA without my permission.
Any thoughts- anyone who knows Tort law here?
What did the terms and conditions of google’s policy say back then in those good old days of quiet cooperation
What about global liability across different countries (like EU and India)
_ I think there should be a lawsuit to discover more (click the link)
The truth is out there!
The following certificate is awarded to Chandan Routray (https://www.linkedin.com/in/chandanroutray) , a 2yr student at an IIT who learnt all this
and wrote some of this at https://python4analytics.wordpress.com/- the following certificate to show he was tested as a potential scientist and showed great promise in executing his task!
This is an interview wit Heiko Miertzsch, founder EODA ( http://www.eoda.de/en/). EODA is a cutting edge startup . recently they launched a few innovative products that made me sit up and pay attention. In this interview, Heiko Miertzsch , the founder of eoda talks on the startup journey and the vision of analytics.
DecisionStats (DS)- Describe the journey of your startup eoda. What made you choose R as the platform for your software and training. Name a few turning points and milestones in your corporate journey
Heiko Miertzsch (HM)- eoda was founded in 2010 by Oliver and me. We both have a strong background in analytics and Information Technology industry. So we observed the market a while before starting the business. We saw two trends: First, a lot of new Technologies and Tools for data analysis appeared and second Open Source seemed to become more and more important for several reasons. Just to name one the easiness to share experience and code in a broad and professional community. Disruptive forces seem to change the market and we just don’t want back the wrong horse.
From the beginning on we tested R and we were enthusiastic. We started choosing R for our projects, software development, services and build up a training program for R. We already believed in 2010 that R has a successful future. It was more flexible than other statistic languages, more powerful in respect of the functionality, you could integrate it in an existing environment and much more.
DS- You make both Software products and services. What are the challenges in marketing both?
HM- We even do more: We provide consulting, training, individual software, customizing software and services. It is pure fun for us to go to our customers and say “hey, we can help you solving your analytical problems, no matter what kind of service you want to buy, what kind of infrastructure you use, if you want to learn about forest trees or buy a SaaS solution to predict your customers revenues”. In a certain way we don’t see barriers between these delivery models because we use analytics as our basis. First of all, we focus on the analytical problem of our customers and then we find the ideal solution together with the customer.
DS- Describe your software tableR. How does it work, what is the pricing and what is the benefit to user. Name a few real life examples if available for usage.
HM- Today the process of data collection, analysis and presenting the results is characterized by the use of a heterogeneous software environment with many tools, file formats and manual processing steps. tableR supports the entire process from design a questionnaire, share a structured format with the CAXI software, import the data and doing the analysis and plot the table report with only one single solution. The base report comes with just one click and if you want to go more into detail you can enhance your analysis with your own R code.
tableR is used in a closed beta at the moment and the open beta will in start next weeks.
(It is available at http://www.eoda.de/en/tableR.html)
DS- Describe your software translateR (http://www.eoda.de/en/translateR.html) . How does it work, what is the pricing and what is the benefit to user. Name a few real life examples if available for usage.
HM- Many companies realized the advantages of the open source programming language R. translateR allows a fast and inexpensive migration to R – currently from SPSS code.
The manual migration of complex SPSS® scripts has always been tedious and error-prone. translateR will help here and the task of translating by hand becomes a thing of the past. The beta test of translateR will also start in the next weeks.
DS- How do you think we can use R on the cloud for better analytics?
HM- Well, R seems to bring together the best “Data Scientists” of the world with all their different focuses on different methods, vertical knowledge, technical experience and more. The cloud is a great workplace: It holds the data – a lot of data and it offers a technical platform with computing power. If it succeeds to bring these two aspects together, we could provide a lot of knowledge to solve a lot of problems – with individual and global impact.
DS- What advantages and disadvantages does working on the cloud give to a R user?
HM- In terms of R I don’t see other aspects than in using the cloud in general.
DS- Startup life can be hectic – what do you do to relax.
HM- Oliver and I have both families, so eoda is our time to relax – just fun. I guess we do the same typical things like others, Oliver plays soccer and goes running. I like any kind of endurance sports and go climbing, the first to give the thoughtless space the second to train to focus on a concrete target.
translateR is the new service from German based R specialist eoda, which helps users to translate SPSS® Code to R automatically. translateR is developed in cooperation with the University of Kassel and financially supported by the LOEWE-program of the state Hessen. translateR will be available as a cloud service and as a desktop application.
eoda offers consulting, software development and training for analytical and statistical questions. eoda is focused on R and specializes in integrating R into existing software environments.
The internet is a ponderful and wonderful place for serendipity