Joshua Eckroth
c7c7cdd9f1
The option for the resample filter is :no-replacement not :replacement
2013-10-12 15:07:00 -04:00
Joshua Eckroth
9b56c2a53f
Use clj-ml.artifice.cc for test arff/csv files
2013-09-21 20:01:31 -04:00
Joshua Eckroth
5626d45654
Fixed tests on Windows
...
Line-ending problem.
2013-09-21 19:53:54 -04:00
Joshua Eckroth
937b9bf87d
Fixed some tests that broke when nominal attributes changed from string to keyword representations.
2013-08-07 10:32:50 -04:00
Joshua Eckroth
8aeed64130
Changed classifier-classify to produce the class label; updated classifier-label to avoid using classifier-classify.
2013-08-06 03:41:12 -04:00
Joshua Eckroth
db70ee980f
Fixed indentation.
2013-07-31 06:50:59 -04:00
Joshua Eckroth
3ead98c527
Added k-nearest neighbor classifier (:lazy :ibk)
2013-07-16 23:29:45 -04:00
Joshua Eckroth
26a9d69c05
Fixed saving/loading csv instances.
2013-07-11 07:40:32 -04:00
Joshua Eckroth
b8cc877c05
Bugfix and added empty test case to force loading/compiling of clj-ml.public-datasets.
2013-07-04 09:42:03 -04:00
Joshua Eckroth
a8ccca6d53
Support stemmers (use Porter/English/etc. rather than lowercase form).
...
Bumped to version 0.3.7.
2013-07-01 14:07:34 -04:00
Joshua Eckroth
4aa08595a1
More fixes for svm grid operation; added libsvm to project deps.
2013-06-21 01:00:55 -04:00
Joshua Eckroth
76fb70c602
Fixed broken test case.
2013-06-21 00:22:48 -04:00
Joshua Eckroth
e0d17445f0
Formatting.
2013-03-30 18:16:08 -04:00
Joshua Eckroth
05cf721ec5
Changed resample filter to resample-unsupervised, and added resample-supervised.
2013-03-22 12:35:24 -04:00
Joshua Eckroth
54eb9cf78b
Added resample filter.
2013-03-22 10:58:03 -04:00
Joshua Eckroth
30d2c1192f
Added an instance-loading test.
2013-03-22 10:57:49 -04:00
Joshua Eckroth
7004dd87cd
Added :reorder-attributes filter.
2013-03-22 10:09:38 -04:00
Joshua Eckroth
ed3bf1d71d
Added StringToWordVector filter.
2013-03-22 07:55:01 -04:00
Joshua Eckroth
436d6701cd
Bumped version, updated dependencies (new stable Weka), fixed a test.
2013-03-21 13:49:27 -04:00
Ben Mabey
dc12081b4a
attaches instance weight info as metadata when converting to a map
2012-03-26 12:20:45 -06:00
Ben Mabey
042de1895a
updates project to clojure 1.3, removes contrib and autodoc deps
2012-01-30 15:27:54 -07:00
Ben Mabey
c25d39521b
removes mongodb store.. I never used it and won't be supporting it
2012-01-05 13:29:45 -07:00
Ben Mabey
fa2d39ffe5
change the dataset's name when randomized (like the filters do)
2011-12-13 20:10:11 -07:00
Ben Mabey
8a01361512
adds take-dataset
2011-12-08 15:28:27 -07:00
Ben Mabey
d313f621d1
adds remove-range filter so split-dataset can take a number to split on
2011-12-08 15:13:34 -07:00
Ben Mabey
0c32c318af
adds split-dataset via new remove-percentage filter wrapper
2011-12-07 11:15:19 -07:00
Ben Mabey
84b1903b63
adds simple wrappers for Weka's attribute (feature) selection package
2011-10-04 13:10:01 -06:00
Ben Mabey
7357a720e9
makes representation of NaNs (no values) symmetric with creation- nil
2011-04-05 13:55:21 -06:00
Ben Mabey
a83e21d72b
allows for creation of instances (and datasets) with nil values
2011-04-05 13:34:23 -06:00
Ben Mabey
44df9744c1
adds dataset-as-vecs and dataset-as-lists
2011-01-04 10:02:48 -07:00
Ben Mabey
8cb6620b6f
adds dataset-class-name
2010-12-30 10:14:57 -07:00
Ben Mabey
7cd8e08446
makes attribute-labels fn that returns set of labels and renames old one
2010-12-29 15:08:05 -07:00
Ben Mabey
228c58f74c
adds headers-only fn to extract the headers of a weka dataset
2010-12-22 14:23:44 -07:00
Ben Mabey
68cf83b712
fixes dataset-format to return nominal labels in correct order
...
Also made it more idiomatic and concise by using reduce...
2010-12-16 16:58:53 -07:00
Ronny Bjarnason
9829c0eea4
fix typo decission->decision in tests
2010-12-14 15:16:39 -07:00
Ben Mabey
4d18af461e
not keywording the instance nominal values by default due to performance
...
On large datasets if you use dataset-as-maps too much time is taken up
in interning the nominal values time and time again. For the vast
majority of use cases string values should be fine. If we were in
clojure-land 100% then keywords would be the better option.. but we
aren't.
2010-12-08 16:10:45 -07:00
Ben Mabey
81cda11ed2
DRYs up some fns and replaces custom code with enumeration-seq
2010-12-08 16:05:41 -07:00
Ben Mabey
652c54aaf7
standardizes how attributes can be asked for (by index or name)
2010-12-07 14:14:20 -07:00
Ben Mabey
99048715f1
adds tests for dataset label helper fns
2010-12-07 13:48:52 -07:00
Ben Mabey
138ee7f016
adds dataset-class-values
2010-12-06 17:36:36 -07:00
Ben Mabey
b6deb0421e
improves filters API by making filter fns that create and apply filters
2010-12-02 11:44:38 -07:00
Ben Mabey
3a393ebcf9
adds make-apply-filters
2010-12-01 16:22:19 -07:00
Ben Mabey
ca7281d214
allows attributes in filters to be specified by name in addition to index
2010-12-01 14:36:05 -07:00
Ben Mabey
78fc93c2a4
allow for dataset-set-class to take the name of attribute
2010-12-01 11:37:27 -07:00
Ben Mabey
1f3900a628
replaces some loop/recurs with more conciser idiomatic clojure
2010-11-24 12:05:30 -07:00
Ben Mabey
ccc12c2d5c
adds dataset-replace-attribute!
2010-11-08 10:37:42 -07:00
Ben Mabey
2d274d2598
adds into-fast-vector util fn
2010-11-08 10:37:42 -07:00
Ben Mabey
eabb2acca5
adds helper fns in data and fixes some docstring misplacements
2010-11-08 10:37:42 -07:00
Ben Mabey
55ff30fcfa
fixes is-dataset? to report correctly
2010-11-04 11:50:42 -06:00
Ben Mabey
249424ad81
adds ClojureBatchFilter
2010-11-03 11:10:41 -06:00