2010-08-23 12:26:48 +00:00
<?xml version="1.0" encoding="utf-8"?>
< !DOCTYPE html PUBLIC "-//W3C//DTD XHTML 1.0 Strict//EN"
"http://www.w3.org/TR/xhtml1/DTD/xhtml1-strict.dtd">
< html xmlns = "http://www.w3.org/1999/xhtml" lang = "fr" xml:lang = "fr" >
< head >
< meta http-equiv = "Content-Type" content = "text/html; charset=UTF-8" / >
< meta name = "keywords" content = "awk, shell, script" >
< link rel = "shortcut icon" type = "image/x-icon" href = "/Scratch/img/favicon.ico" / >
< link rel = "stylesheet" type = "text/css" href = "/Scratch/assets/css/main.css" / >
< link rel = "stylesheet" type = "text/css" href = "/Scratch/css/twilight.css" / >
< link rel = "stylesheet" type = "text/css" href = "/Scratch/css/idc.css" / >
< link rel = "alternate" type = "application/rss+xml" title = "RSS" href = "http://feeds.feedburner.com/yannespositocomfr" / >
< link rel = "alternate" lang = "fr" xml:lang = "fr" title = "découper un fichier par mots clés" type = "text/html" hreflang = "fr" href = "/Scratch/fr/blog/2010-02-18-split-a-file-by-keyword/" / >
< link rel = "alternate" lang = "en" xml:lang = "en" title = "split a file by keyword" type = "text/html" hreflang = "en" href = "/Scratch/en/blog/2010-02-18-split-a-file-by-keyword/" / >
< script type = "text/javascript" src = "/Scratch/js/jquery-1.3.1.min.js" > < / script >
< script type = "text/javascript" src = "/Scratch/js/jquery.cookie.js" > < / script >
< script type = "text/javascript" src = "/Scratch/js/index.js" > < / script >
< title > découper un fichier par mots clés< / title >
< / head >
< body lang = "fr" >
< script type = "text/javascript" > / / < ! [ C D A T A [
document.write('< div id = "blackpage" > < img src = "/Scratch/img/loading.gif" alt = "Chargement en cours..." / > < / div > ');
// ]]>
< / script >
< div id = "content" >
2010-09-27 18:49:15 +00:00
< div id = "choix" >
< div class = "return" > < a href = "#entete" > ↓ Menu ↓ < / a > < / div >
< div id = "choixlang" >
< a href = "/Scratch/en/blog/2010-02-18-split-a-file-by-keyword/" onclick = "setLanguage('en')" > in English< / a >
< / div >
< / div >
< img src = "/Scratch/img/presentation.png" alt = "Presentation drawing" / >
2010-08-23 12:26:48 +00:00
< div id = "titre" >
< h1 >
découper un fichier par mots clés
< / h1 >
< / div >
< div class = "flush" > < / div >
< div class = "flush" > < / div >
< div id = "afterheader" >
< div class = "corps" >
< p > Assez bizarrement, je n’ ai trouvé aucun outil UNIX pour découper un fichier par mot clé.
Alors j’ en ai fait un en < code > awk< / code > . Je le met ici principalement pour moi, mais ça peut toujours servir à quelqu’ un d’ autre.
Le code suivant découpe un fichier pour chacune de ses ligne contenant le mot < code > UTC< / code > .< / p >
< div > < pre class = "twilight" >
< span class = "Comment" > < span class = "Comment" > #< / span > !/usr/bin/env awk< / span >
< span class = "Entity" > BEGIN< / span > {i=0;}
< span class = "StringRegexp" > < span class = "StringRegexp" > /< / span > UTC< span class = "StringRegexp" > /< / span > < / span > {
i+=1;
FIC=< span class = "SupportFunction" > sprintf< / span > (< span class = "String" > < span class = "String" > " < / span > fic.%03d< span class = "String" > " < / span > < / span > ,i);
}
{< span class = "SupportFunction" > print< / span > < span class = "Variable" > < span class = "Variable" > $< / span > 0< / span > > > FIC}
< / pre > < / div >
< p > En réalité, j’ avais besoin de cet outils pour avoir un fichier par jour. Chaque ligne contenant UTC ayant le format suivant :< / p >
< pre class = "twilight" >
Mon Dec 7 10:32:30 UTC 2009
< / pre >
< p > J’ en suis finallement arrivé au code suivant :< / p >
< div > < pre class = "twilight" >
< span class = "Comment" > < span class = "Comment" > #< / span > !/usr/bin/env awk< / span >
< span class = "Entity" > BEGIN< / span > {i=0;}
< span class = "StringRegexp" > < span class = "StringRegexp" > /< / span > UTC< span class = "StringRegexp" > /< / span > < / span > {
date=< span class = "Variable" > < span class = "Variable" > $< / span > 1< / span > < span class = "Variable" > < span class = "Variable" > $< / span > 2< / span > < span class = "Variable" > < span class = "Variable" > $< / span > 3< / span > ;
< span class = "Keyword" > if< / span > ( date != olddate ) {
olddate=date;
i+=1;
FIC=< span class = "SupportFunction" > sprintf< / span > (< span class = "String" > < span class = "String" > " < / span > fic.%03d< span class = "String" > " < / span > < / span > ,i);
}
}
{< span class = "SupportFunction" > print< / span > < span class = "Variable" > < span class = "Variable" > $< / span > 0< / span > > > FIC}
< / pre > < / div >
< / div >
< div id = "choixrss" >
< a id = "rss" href = "http://feeds.feedburner.com/yannespositocomfr" >
s'abonner
< / a >
< / div >
< script type = "text/javascript" >
$(document).ready(function(){
$('#comment').hide();
$('#clickcomment').click(showComments);
});
function showComments() {
$('#comment').show();
$('#clickcomment').fadeOut();
}
document.write('< div id = "clickcomment" > Commentaires< / div > ');
< / script >
< div class = "flush" > < / div >
< div class = "corps" id = "comment" >
< h2 class = "first" > commentaires< / h2 >
< noscript >
< / noscript >
< script type = "text/javascript" >
var idcomments_acct = 'a307f0044511ff1b5cfca573fc0a52e7';
var idcomments_post_id = '/Scratch/fr/blog/2010-02-18-split-a-file-by-keyword/';
var idcomments_post_url = 'http://yannesposito.com/Scratch/fr/blog/2010-02-18-split-a-file-by-keyword/';
< / script >
< span id = "IDCommentsPostTitle" style = "display:none" > < / span >
< script type = 'text/javascript' src = '/Scratch/js/genericCommentWrapperV2.js' > < / script >
< / div >
< div id = "entete" class = "corps_spaced" >
< div id = "liens" >
2010-09-17 15:15:29 +00:00
< ul > < li > < a href = "/Scratch/fr/" > Accueil< / a > < / li >
2010-08-23 12:26:48 +00:00
< li > < a href = "/Scratch/fr/blog/" > Blog< / a > < / li >
2010-09-30 13:01:14 +00:00
< li > < a href = "/Scratch/fr/softwares/" > Softwares< / a > < / li >
2010-09-28 01:00:51 +00:00
< li > < a href = "/Scratch/fr/about/" > À propos< / a > < / li > < / ul >
2010-08-23 12:26:48 +00:00
< / div >
< div class = "flush" > < / div >
< hr / >
< div id = "next_before_articles" >
< div id = "previous_articles" >
articles précédents
< div class = "previous_article" >
2010-09-28 15:10:12 +00:00
< a href = "/Scratch/fr/blog/2010-02-16-All-but-something-regexp--2-/" > < span class = "nicer" > «< / span > Tout sauf quelquechose en expression régulière.< / a >
2010-08-23 12:26:48 +00:00
< / div >
< div class = "previous_article" >
2010-09-28 15:10:12 +00:00
< a href = "/Scratch/fr/blog/2010-02-15-All-but-something-regexp/" > < span class = "nicer" > «< / span > Expression régulière pour tout sauf quelquechose< / a >
2010-08-23 12:26:48 +00:00
< / div >
< div class = "previous_article" >
2010-09-28 15:10:12 +00:00
< a href = "/Scratch/fr/blog/2010-01-12-antialias-font-in-Firefox-under-Ubuntu/" > < span class = "nicer" > «< / span > Fontes adoucies sous Ubuntu Firefox< / a >
2010-08-23 12:26:48 +00:00
< / div >
< / div >
< div id = "next_articles" >
articles suivants
< div class = "next_article" >
2010-09-28 15:10:12 +00:00
< a href = "/Scratch/fr/blog/2010-02-23-When-regexp-is-not-the-best-solution/" > Quand se passer des expressions régulières ? < span class = "nicer" > »< / span > < / a >
2010-08-23 12:26:48 +00:00
< / div >
< div class = "next_article" >
2010-09-28 15:10:12 +00:00
< a href = "/Scratch/fr/blog/2010-03-22-Git-Tips/" > Astuces Git < span class = "nicer" > »< / span > < / a >
2010-08-23 12:26:48 +00:00
< / div >
< div class = "next_article" >
2010-09-28 15:10:12 +00:00
< a href = "/Scratch/fr/blog/2010-03-23-Encapsulate-git/" > Encapsuler git < span class = "nicer" > »< / span > < / a >
2010-08-23 12:26:48 +00:00
< / div >
< / div >
< div class = "flush" > < / div >
< / div >
< / div >
< div id = "bottom" >
< div >
< a rel = "license" href = "http://creativecommons.org/licenses/by-sa/3.0/deed.fr" > Droits de reproduction ©, Yann Esposito< / a >
< / div >
< div id = "lastmod" >
2010-08-31 13:06:43 +00:00
Écrit le : 18/02/2010
2010-09-02 09:51:46 +00:00
modifié le : 09/05/2010
2010-08-23 12:26:48 +00:00
< / div >
< div >
Site entièrement réalisé avec
< a href = "http://www.vim.org" > Vim< / a >
et
< a href = "http://nanoc.stoneship.org" > nanoc< / a >
< / div >
< div >
< a href = "/Scratch/fr/validation/" > Validation< / a >
< a href = "http://validator.w3.org/check?uri=referer" > [xhtml] < / a >
.
< a href = "http://jigsaw.w3.org/css-validator/check/referer?profile=css3" > [css] < / a >
.
< a href = "http://validator.w3.org/feed/check.cgi?url=http%3A//yannesposito.com/Scratch/fr/blog/feed/feed.xml" > [rss]< / a >
< / div >
< / div >
< div class = "clear" > < / div >
< / div >
< / body >
< / html >