Introduction
    
    The first paragraph
    
[]           
[div]           
[div, div]          
[div, div, p]           
                            Introduction
[div, div]              
[div]               
[div, p]            
                        The first paragraph
[div]               
[div]               
[div, p]            
                        Another long paragraph
[div]               
[]

L'algorithme est alors très simple :


let res be the XML as a string ; 
read res and each time you encouter a tag: 
    if it is an opening one: 
        push it to the stack
    else if it is a closing one: 
        pop the stack.

remove any malformed/cutted tag in the end of res
for each tag in the stack, pop it, and write:
    res = res + closed tag

return res

Et `res` contiend le XML réparé. Finallement, voici le code en ruby que j'utilise. La variable `xml` contient le XML coupé.


# repair cutted XML code by closing the tags
# work even if the XML is cut into a tag.
# example:
#    transform '

toto

hello ]*$/m,'') depth-=1 depth.downto(0).each { |x| res<<= %{} } res end Je ne sais pas si ce code pourra vous être utile. Par contre le raisonnement pour y parvenir mérite d'être connu.