scratch/content/html/en/blog/Higher-order-function-in-zsh.md

216 lines
5 KiB
Markdown
Raw Normal View History

2011-09-27 15:11:40 +00:00
-----
isHidden: false
menupriority: 1
kind: article
2011-09-28 15:47:13 +00:00
created_at: 2011-09-28T15:15:23+02:00
2011-09-27 15:11:40 +00:00
title: Higher order function in zsh
author_name: Yann Esposito
author_uri: yannesposito.com
2011-10-26 08:49:00 +00:00
tags:
- zsh
- map
- foldr
- filter
- functional
- programming
- higher order functions
2011-09-27 15:11:40 +00:00
-----
blogimage("main.jpg","Title image")
2011-09-27 15:11:40 +00:00
begindiv(intro)
2012-07-09 22:30:25 +00:00
UPDATE: [Nicholas Sterling had discovered a way to implement anonymous functions](http://nicholassterling.wordpress.com/2012/03/30/a-zsh-map-function/)
2012-07-09 21:59:49 +00:00
and [Arash Rouhani has made a github repo which make install easier and added some tests](https://github.com/Tarrasch/zsh_functional).
2012-07-09 22:30:25 +00:00
Thanks to both of you!
2012-07-09 21:59:49 +00:00
With this last version you should use `map` if you use external function.
`mapl` to use lambda function. And `mapa` for arithmetic operations.
Example:
<code class="zsh">
2012-07-09 22:30:25 +00:00
$ filterl 'echo $1|grep a >/dev/null' ab cd ef ada
ab
ada
$ folda '$1+$2' {1..5}
15
$ folda '$1*$2' {1..20}
2432902008176640000
2012-07-10 06:38:07 +00:00
$ mapl 'echo X $1:t Y' ~/.zsh/functional/src/*
2012-07-09 22:30:25 +00:00
X each Y
X filter Y
X fold Y
X map Y
$ mapa '$1*2' {1..3}
2
4
6
2012-07-10 06:38:07 +00:00
$ mapl 'echo result $1' $(mapa '$1+5' $(mapa '$1*2' {1..3}))
2012-07-09 21:59:49 +00:00
result 7
result 9
result 11
2012-07-09 22:30:25 +00:00
2012-07-09 21:59:49 +00:00
</code>
%tldr some simple implementation of higher order function for zsh.
2011-09-27 15:11:40 +00:00
enddiv
2011-09-28 15:47:13 +00:00
Why is it important to have these functions?
Simply because, the more I programmed with zsh the more I tended to work using functional programming style.
2011-09-27 15:11:40 +00:00
2011-09-28 15:47:13 +00:00
The minimal to have better code are the functions `map`, `filter` and `fold`.
2011-09-27 15:11:40 +00:00
2011-09-28 15:47:13 +00:00
Let's compare.
First a program which convert all gif to png in many different directories of different projects.
Before ⇒
2011-09-27 15:11:40 +00:00
<code class="zsh">
# for each directory in projects dir
for toProject in /path/to/projects/*(/N); do
# toProject is /path/to/projects/foo
# project become foo (:t for tail)
project=${toProject:t}
for toResource in $toProject/resources/*.gif(.N); do
2011-09-28 15:47:13 +00:00
convert $toResource ${toResource:r}.png && \
2011-09-27 15:11:40 +00:00
\rm -f $toResource
done
done
</code>
2011-09-29 10:34:44 +00:00
- The `(/N)` means to select only directory and not to crash if there isn't any.
- The `(.N)` means to select only files and not to crash if there isn't any.
- The `:t` means tail; if `toto=/path/to/file.ext` then `${toto:t}=file.ext`.
2011-09-28 15:47:13 +00:00
After ⇒
2011-09-27 15:11:40 +00:00
<code class="zsh">
gif_to_png() { convert $1 ${1:r}.png && \rm -f $1 }
2011-09-28 15:47:13 +00:00
2011-09-27 15:11:40 +00:00
handle_resources() { map gif_to_png $1/resources/*.gif(.N) }
2011-09-28 15:47:13 +00:00
2011-09-27 15:11:40 +00:00
map handle_resources /path/to/projects/*(/N)
</code>
2011-09-28 15:47:13 +00:00
No more bloc!
It might be a little bit harder to read if you're not used to functional programming notation.
But it is more concise and robusts.
2011-09-27 15:11:40 +00:00
2011-09-28 15:47:13 +00:00
Another example with some tests.
2011-09-27 15:11:40 +00:00
2011-09-28 15:47:13 +00:00
Find all files in project not containing an `s` which their name contains their project name:
2011-09-27 15:11:40 +00:00
2011-09-28 15:45:28 +00:00
Before ⇒
2011-09-27 15:11:40 +00:00
<code class="zsh">
for toProject in Projects/*; do
project=$toProject:t
if print -- project | grep -v s >/dev/null
then
print $project
for toResource in $toProject/*(.N); do
if print -- ${toResource:t} | grep $project >/dev/null; then
print -- "X $toResource"
fi
done
fi
done
</code>
2011-09-28 15:45:28 +00:00
After ⇒
2011-09-27 15:11:40 +00:00
<code class="zsh">
contain_no_s() { print $1 | grep -v s }
2011-09-28 15:45:28 +00:00
2011-09-27 15:11:40 +00:00
function verify_file_name {
local project=$1:t
contains_project_name() { print $1:t | grep $project }
map "print -- X" $(filter contains_project_name $1/*(.N))
}
2011-09-28 15:45:28 +00:00
2011-09-29 10:34:44 +00:00
map verify_file_name $( filter contain_no_s Projects/* )
2011-09-27 15:11:40 +00:00
</code>
Also, the first verstion is a bit easier to read.
But the second one is clearly far superior in architecture.
2011-09-28 15:47:13 +00:00
I don't want to argue why here.
Just believe me that the functional programming approach is superior.
Actually I lack the lambda operator.
If someone has an idea on how to create anonymous functions, just tell me, thanks.
2012-07-09 22:30:25 +00:00
Here is the (first version[^1]) source code:
[^1]: As stated in the intro, if you want to install it, just go [there](https://github.com/Tarrasch/zsh_functional).
2011-09-28 15:45:28 +00:00
<code class="zsh" file="functional.sh">
#!/usr/bin/env zsh
# Provide higer-order functions
# usage:
#
# $ foo(){print "x: $1"}
# $ map foo a b c d
# x: a
# x: b
# x: c
# x: d
function map {
local func_name=$1
shift
for elem in $@; print -- $(eval $func_name $elem)
}
# $ bar() { print $(($1 + $2)) }
# $ fold bar 0 1 2 3 4 5
# 15
# -- but also
# $ fold bar 0 $( seq 1 100 )
function fold {
if (($#<2)) {
print -- "ERROR fold use at least 2 arguments" >&2
return 1
}
if (($#<3)) {
print -- $2
return 0
} else {
local acc
local right
local func_name=$1
local init_value=$2
local first_value=$3
shift 3
right=$( fold $func_name $init_value $@ )
acc=$( eval "$func_name $first_value $right" )
print -- $acc
return 0
}
}
# usage:
#
# $ baz() { print $1 | grep baz }
# $ filter baz titi bazaar biz
# bazaar
function filter {
local predicate=$1
local result
typeset -a result
shift
for elem in $@; do
if eval $predicate $elem >/dev/null; then
result=( $result $elem )
fi
done
print $result
}
</code>