Commit graph

250 commits

Author SHA1 Message Date
Joshua Eckroth
dcf6534ea4 Removed UI code (Weka can do that better) and some other unused or broken dependencies. 2013-08-07 10:32:24 -04:00
Joshua Eckroth
65a851341b Added regression example to readme and new function for regression, classifier-predict-numeric. 2013-08-07 10:24:45 -04:00
Joshua Eckroth
ddace20320 Added :replace-missing-values filter and updated readme. 2013-08-06 21:37:17 -04:00
Joshua Eckroth
a18cbbae19 Added example of using Titanic survival data from Kaggle
https://www.kaggle.com/c/titanic-gettingStarted
2013-08-06 19:44:23 -04:00
Joshua Eckroth
c650f86c3a Bugfix for (classifier-label) 2013-08-06 19:43:46 -04:00
Joshua Eckroth
73953ef2fb Removed useless files. 2013-08-06 12:19:34 -04:00
Joshua Eckroth
c44917b0fc Fixed bulleted lists in readme. 2013-08-06 12:18:17 -04:00
Joshua Eckroth
5d7faa2b22 Grammer in readme. 2013-08-06 12:16:03 -04:00
Joshua Eckroth
914b65a5dc Added clusterer-cluster examples to readme. 2013-08-06 12:15:58 -04:00
Joshua Eckroth
81cc54f8a3 Fixed author links in readme. 2013-08-06 12:15:38 -04:00
Joshua Eckroth
8e15fcc3ee Updated authors in readme. 2013-08-06 12:08:41 -04:00
Joshua Eckroth
10310d74e8 Simpler usage for docs-to-dataset. 2013-08-06 03:42:20 -04:00
Joshua Eckroth
59f4cf3697 Improved some dataset functions that operate on the class attribute. 2013-08-06 03:42:03 -04:00
Joshua Eckroth
3064722b14 Formatting. 2013-08-06 03:41:21 -04:00
Joshua Eckroth
8aeed64130 Changed classifier-classify to produce the class label; updated classifier-label to avoid using classifier-classify. 2013-08-06 03:41:12 -04:00
Joshua Eckroth
53d141019f Added codox metadata to project.clj 2013-08-06 03:40:20 -04:00
Joshua Eckroth
7a450f2e04 Updated tutorial in readme. 2013-08-06 03:40:05 -04:00
Joshua Eckroth
7a90091fba Removed some debugging statements. 2013-08-04 08:58:57 -04:00
Joshua Eckroth
19d3772bc0 Wrapped some noisy operations in (capture-out-err) which captures and discards stdout and stderr. 2013-08-04 08:57:49 -04:00
Joshua Eckroth
db70ee980f Fixed indentation. 2013-07-31 06:50:59 -04:00
Joshua Eckroth
2945f082bb Use default 0 for :normalize param (rather than false) 2013-07-17 00:43:05 -04:00
Joshua Eckroth
b5f92c5ced Support :normalize option in docs-to-dataset 2013-07-17 00:41:02 -04:00
Joshua Eckroth
8d7c41c25b Typo. 2013-07-16 23:59:10 -04:00
Joshua Eckroth
0e9b0bdb14 Add :counts option for docs-to-dataset 2013-07-16 23:58:14 -04:00
Joshua Eckroth
1105dac7b8 Don't limit the size of fulltext in docs-to-dataset 2013-07-16 23:51:12 -04:00
Joshua Eckroth
3ead98c527 Added k-nearest neighbor classifier (:lazy :ibk) 2013-07-16 23:29:45 -04:00
Joshua Eckroth
26a9d69c05 Fixed saving/loading csv instances. 2013-07-11 07:40:32 -04:00
Joshua Eckroth
0da42ca0ea Filter out junk from text fields. 2013-07-11 00:24:24 -04:00
Joshua Eckroth
8b53ee681c Shuffle docs (with/without term) when making a dataset. 2013-07-04 15:32:07 -04:00
Joshua Eckroth
b8cc877c05 Bugfix and added empty test case to force loading/compiling of clj-ml.public-datasets. 2013-07-04 09:42:03 -04:00
Joshua Eckroth
3dd4d872cd Switching to 0.4.0-SNAPSHOT to support rapid changes. 2013-07-04 09:41:36 -04:00
Joshua Eckroth
7b71a3d0f0 Bumped to version 0.3.13. 2013-07-04 09:38:52 -04:00
Joshua Eckroth
cce83d924d Bugfix. 2013-07-04 09:36:31 -04:00
Joshua Eckroth
623e9a1ef3 New format for terms on public datasets. Bumped to version 0.3.12. 2013-07-04 09:34:53 -04:00
Joshua Eckroth
4a4ef5ea03 Starting support for public datasets. Added reuters21578 dataset handling.
Bumped to version 0.3.11.
2013-07-04 09:00:48 -04:00
Joshua Eckroth
6f03716d0a Added function clj-ml.data/docs-to-dataset to support translating text documents (with title, fulltext, and terms) into wordvec datasets for binary classification.
Bumped to version 0.3.10.
2013-07-04 08:53:45 -04:00
Joshua Eckroth
123cd1713c Switched to a modern Snowball stemmer implementation. Bumped version to 0.3.9. 2013-07-01 14:40:05 -04:00
Joshua Eckroth
edceff891b Fixed bug reading instance attribute values.
Bumped to 0.3.8.
2013-07-01 14:08:01 -04:00
Joshua Eckroth
a8ccca6d53 Support stemmers (use Porter/English/etc. rather than lowercase form).
Bumped to version 0.3.7.
2013-07-01 14:07:34 -04:00
Joshua Eckroth
2ce2d41885 Capture log messages from libsvm (which is noisy). 2013-06-21 01:20:40 -04:00
Joshua Eckroth
4aa08595a1 More fixes for svm grid operation; added libsvm to project deps. 2013-06-21 01:00:55 -04:00
Joshua Eckroth
52a1de6732 Bugfixes for new svm grid classifier. 2013-06-21 00:32:46 -04:00
Joshua Eckroth
6afa040268 Updated version. 2013-06-21 00:25:35 -04:00
Joshua Eckroth
b362e31f6b Merge branch 'master' of github.com:joshuaeckroth/clj-ml
# Please enter a commit message to explain why this merge is necessary,
# especially if it merges an updated upstream into a topic branch.
#
# Lines starting with '#' will be ignored, and an empty message aborts
# the commit.
2013-06-21 00:24:16 -04:00
Joshua Eckroth
f9704e67a4 Added libsvm-grid classifier type, which is really a collection of classifiers. 2013-06-21 00:23:24 -04:00
Joshua Eckroth
76fb70c602 Fixed broken test case. 2013-06-21 00:22:48 -04:00
Joshua Eckroth
e031060f01 Merge pull request #2 from shark8me/patch-2
Update README.md
2013-05-27 15:37:59 -07:00
Joshua Eckroth
78983b52cd Merge pull request #1 from shark8me/patch-1
Update README.md
2013-05-27 15:36:49 -07:00
Kiran Karkera
fff2957979 Update README.md
Line 114-116
Changed the loading of filtered-ds, the '(' was in the wrong place.
2013-05-26 10:47:10 +05:30
Kiran Karkera
6b63bf1dcc Update README.md
removed an addition 's' from 
(def classifier (make-classifier :decission-tree :c45)) 
to
(def classifier (make-classifier :decision-tree :c45))
2013-05-26 10:09:50 +05:30