Commit graph

43 commits

Author SHA1 Message Date
Ben Mabey
7357a720e9 makes representation of NaNs (no values) symmetric with creation- nil 2011-04-05 13:55:21 -06:00
Ben Mabey
a83e21d72b allows for creation of instances (and datasets) with nil values 2011-04-05 13:34:23 -06:00
Ben Mabey
44df9744c1 adds dataset-as-vecs and dataset-as-lists 2011-01-04 10:02:48 -07:00
Ben Mabey
8cb6620b6f adds dataset-class-name 2010-12-30 10:14:57 -07:00
Ben Mabey
7cd8e08446 makes attribute-labels fn that returns set of labels and renames old one 2010-12-29 15:08:05 -07:00
Ben Mabey
228c58f74c adds headers-only fn to extract the headers of a weka dataset 2010-12-22 14:23:44 -07:00
Ben Mabey
68cf83b712 fixes dataset-format to return nominal labels in correct order
Also made it more idiomatic and concise by using reduce...
2010-12-16 16:58:53 -07:00
Ronny Bjarnason
9829c0eea4 fix typo decission->decision in tests 2010-12-14 15:16:39 -07:00
Ben Mabey
4d18af461e not keywording the instance nominal values by default due to performance
On large datasets if you use dataset-as-maps too much time is taken up
in interning the nominal values time and time again.  For the vast
majority of use cases string values should be fine.  If we were in
clojure-land 100% then keywords would be the better option.. but we
aren't.
2010-12-08 16:10:45 -07:00
Ben Mabey
81cda11ed2 DRYs up some fns and replaces custom code with enumeration-seq 2010-12-08 16:05:41 -07:00
Ben Mabey
652c54aaf7 standardizes how attributes can be asked for (by index or name) 2010-12-07 14:14:20 -07:00
Ben Mabey
99048715f1 adds tests for dataset label helper fns 2010-12-07 13:48:52 -07:00
Ben Mabey
138ee7f016 adds dataset-class-values 2010-12-06 17:36:36 -07:00
Ben Mabey
b6deb0421e improves filters API by making filter fns that create and apply filters 2010-12-02 11:44:38 -07:00
Ben Mabey
3a393ebcf9 adds make-apply-filters 2010-12-01 16:22:19 -07:00
Ben Mabey
ca7281d214 allows attributes in filters to be specified by name in addition to index 2010-12-01 14:36:05 -07:00
Ben Mabey
78fc93c2a4 allow for dataset-set-class to take the name of attribute 2010-12-01 11:37:27 -07:00
Ben Mabey
1f3900a628 replaces some loop/recurs with more conciser idiomatic clojure 2010-11-24 12:05:30 -07:00
Ben Mabey
ccc12c2d5c adds dataset-replace-attribute! 2010-11-08 10:37:42 -07:00
Ben Mabey
2d274d2598 adds into-fast-vector util fn 2010-11-08 10:37:42 -07:00
Ben Mabey
eabb2acca5 adds helper fns in data and fixes some docstring misplacements 2010-11-08 10:37:42 -07:00
Ben Mabey
55ff30fcfa fixes is-dataset? to report correctly 2010-11-04 11:50:42 -06:00
Ben Mabey
249424ad81 adds ClojureBatchFilter 2010-11-03 11:10:41 -06:00
Ben Mabey
1412e9a86f allows ClojureStreamFilter to use a fn for updating the dataset-format 2010-11-02 17:42:03 -06:00
Ben Mabey
78a99eaa35 adds basic ClojureStreamFilter which accepts a clojure fn for processing 2010-11-02 16:51:15 -06:00
Ben Mabey
d87640e464 adds add-attribute filter 2010-11-01 17:13:35 -06:00
Ben Mabey
b09a59025f adds numeric to nominal filter 2010-11-01 14:05:11 -06:00
Ben Mabey
6ec5f1f2a0 similar options refactorings in classifiers that was done in filters 2010-11-01 11:14:07 -06:00
Ben Mabey
dfe20783af more filters DRYing 2010-10-29 17:58:07 -06:00
Ben Mabey
51fc383c15 general filters refactoring- mostly DRYing stuff up 2010-10-29 16:57:20 -06:00
Ben Mabey
497d65dffc adds RemoveUseless filter wrapper and does some test refactoring 2010-10-29 15:55:29 -06:00
Ben Mabey
790049feac bumps to clojure 1.2 2010-10-27 17:47:14 -06:00
Antonio Garrote
47e78f5fb4 Documentation for clusterers and small fixes in the clusterer options for :expectation-maximization clusterer 2010-03-16 09:22:49 +01:00
Antonio Garrote
ecd2e3579f Documentation and comments for filter.clj. Not finished yet. 2010-03-08 09:20:37 +01:00
Antonio Garrote
f91328f5e4 SMO Support Vector Machines and different kernel functions: polynomic, radia-basis and string kernels 2010-03-07 18:22:04 +01:00
Antonio Garrote
c1c7ba0ac5 Support for persisting data sets into MongoDB 2010-03-06 12:10:26 +01:00
Antonio Garrote
8facf1dfe2 EM clusterer added, remove-attributes filter added, improved syntax for filters and make-apply-filter form 2010-03-04 13:56:50 +01:00
Antonio Garrote
b13fd3b4b4 Updateable clusterers 2010-03-03 16:37:06 +01:00
Antonio Garrote
2670b037b2 Small improvements 2010-03-03 14:46:08 +01:00
Antonio Garrote
c37d36e5d3 Distance functions and distance function parameters for clusterers 2010-03-02 22:47:35 +01:00
Antonio Garrote
00ff6b1bc4 Visualization of more objects, clustering of collections and good clusterers evaluation 2010-03-02 21:24:11 +01:00
Antonio Garrote
eb60fe6d31 Some tests for clusterers 2010-02-28 20:51:17 +01:00
Antonio Garrote
360e507bd2 First commit 2010-02-28 13:14:17 +01:00