DSTK - Data Science ToolKits


DSTK - DataScience ToolKit is a free software for statistical analysis, data visualization, text analysis, and predictive analytics. It is designed to be straight forward and easy to use, and familar to SPSS user. While JASP offers more statistical features, DSTK tends to be a broad solution workbench, including text analysis and predictive analytics features. Under settings, you can specify the software path to use for advanced prediction modeling, data transformation/editing, and python IDE. Of course you may specify JASP for advanced data editing and RapidMiner for advanced prediction modeling. DSTK is written in C#, Java and Python to interface with R, NLTK, and Weka. It can be expanded with plugins using R Scripts. We have also created plugins for more statistical functions, and Big Data Analytics with Microsoft Azure HDInsights (Spark Server) with Livy.



License: R, RStudio, NLTK, SciPy, SKLearn, MatPlotLib, Weka, ... each has their own licenses.

FEATURES



Statistics

1. Descriptives (mean, median, variance, standard deviations, ...)
2. Inferential (T-Test, ANOVA, Wilcoxon, ...)
3. Regression (Linear, ...)
... And interface with R and Python, and GNumeric, ...

Data Visualizations

1. Histogram
2. Scatter Plots
3. Box Plots
... And more...

Predictive Analytics

1. REPTree
2. MLP
... And more with WEKA Algorithms...

Text Mining and Analysis

1. Text Preprocessing (stopwords, porter stemmer, regular expressions, ..., semi auto preprocessing)
2. POS Tagging, Name Entity, Word Net
3. Sentiment Analysis
4. Text Classification (Naive Bayes, NN, ...)
... And more with gazetteers from GATE...

Plugins

1. Expand features with R Scripts...
2. Included plugins for Big Data Analysis using Microsoft Azure...


Download DSTK - DataScience ToolKit

Screenshots




Other Softwares...



JAOSS - Online Statistical System


JAOSS - Just Another Online Statistical System. A simple statistical system with fairly sophisticated features such as descriptive statistics analysis, inferential statistics with ANOVA and T-Test, Predictive Analytics with Neural Network. The application is written in R and Shiny, with an aim in mind to provide online access to simplistic statistical system before proceed to advanced softwares such as SPSS or DSTK. This product is currently available as FREEware.

View details »

Demo »

JATAS - Online Text Analysis


JATAS - Just Another Text Analysis System. A simple text analytics system with fairly sophisticated features such as text preprocessing (stemmer, stopwords...), Visualizations, and Predictive Analytics with SVM. The application is written in R and Shiny, with an aim in mind to provide online access to simplistic text analysis system before proceed to advanced softwares such as SPSS Modeler or DSTK. This product is currently available as FREEware..

View details »

Demo »

Plagiarism Checker


A simple plagiarism checker that allows writers or educators to check for plagiarism of in a simple text, using Google and Bing search engines. You have the option to check by sentences or based on number of words. This product is currently available as FREEware. Enjoy.

View details »

About Us


DSTK Tech is part of SVBook. Our main goal is to create useful data science tools for practitioners in both academia and business to reach fast conclusions before going into deeper tools like SPSS Statistics. DSTK was designed with the user in mind, using SPSS and Excel like interface to reduce the learning curves. DSTK is free of charge and have been uploaded to Sourceforge.net.

Contact

Question?

Singapore