Make a complete DBpedia dump URL-unescaped

Posted Almost 11 years ago. Visible to the public.

cat dbpedia_2013_07_18.nt | sed 's/%([0-9A-F][0-9A-F])/\\\x\1/g;s/"/\"/g;s/'"'"'/\'"'"'/g' | xargs echo -e | gawk '{print(gensub(/ . </," .\n<","g",$0))}' > ./dbpedia_2013_07_18-CLEAN.nt

mgns
Posted by mgns to FB10 (2013-07-26 12:38)