Category: Data Analysis

Free WEKA machine learning algorithms for data mining tool

          In exploring the data analytics tools (Knime, Rapid Miner, FME, Orange..) there has been references to WEKA. Weka is a collection of machine learning algorithms for data mining tasks. The algorithms can either be applied directly to a dataset or called from your own Java code. Weka contains tools for data pre-processing, classification, regression,

Regular Expressions/Regex for data cleaning

        A regular expression, regex or regexp A regular expression, regex or regexp is, in theoretical computer science and formal language theory, a sequence of characters that define a search pattern. Usually this pattern is then used by string searching algorithms for “find” or “find and replace” operations on strings, or for input validation. From Wikipedia.

RapidMiner Studio free Data Science Tool

        It provides a wealth of functionality to speed & optimize data exploration, blending & cleansing tasks โ€“ reducing the time spent importing and wrangling your data. RapidMiner provides an integrated environment for data preparation, machine learning, deep learning, text mining, and predictive analytics. RapidMiner Studio (Some information see item 18 of list).  This programme keeps

Orange 3. Text Mining basic exploration

        A few words of jargon in the Text Mining area. Corpus. In linguistics, a corpus or text corpus is a large and structured set of texts. They are used to do statistical analysis and hypothesis testing, checking occurrences or validating linguistic rules within a specific language territory. Token. Tokenization is the process of demarcating and

Orange free Data Mining Tool

         I was looking through 101-useful-websites article and came across AlternativeTo.net and used it to look up alternatives to say “Revit” and “AutoCad” and other tools I use. I then typed in KNIME which I use for data mining, data analysis and it came up with Orange as a free alternative.  So I looked at

DataTables for dynamic database queries for tables on web pages & export tools

        I have been interested in displaying tabulated data on a web page from a Database, exploring what is out there to use. WordPress has a couple of Free Table add-in’s but they are usually only available for uploading static data to your website (or from Excel/CSV). There is one WordPress Add-in that read from a

Accuracy of Data from Physical Survey versus BIM Extraction

           This Survey uses Revit Model Measures to Compare against SPM Surveyed Data of 26 identical apartments. The Survey This is comparison of Duncan Terrace Flatsย  A01 to A26 which are all similar,( some handed so bathroom/kitchens are flipped- see plans below). Each individual flat was surveyed by one of 5 surveyors who had

Revit data extraction for Asset Management Information System (AMIS)

        Capturing information on existing assets is a challenge Wellington City Council (WCC) has over 2200 social housing units and so getting good information on them in a consistent manner was a challenge. They undertook a condition survey with surveyors on the properties. It took the surveyors on average 4 hours to survey each unit. There