[Rpm-metadata] createrepo: initial comments and a UTF-8 patch

seth vidal skvidal at phy.duke.edu
Mon Jul 26 06:25:50 UTC 2004


> I tested with some self-created nasty ones, as well as actual Conectiva
> packages.  Could not find a PLD package with "bad" chars in any of the
> fields in a quick search (but added ISO-8859-2 to the list of encodings
> to guess anyway, thinking about PLD :)
> 
> Note that if the conversion is unsuccessful, it falls back to the old
> "?" stuff.  And as said, the current code fails with stuff that is
> already in UTF-8.  Try any recent package by yours truly...

Cool. I'll take a look at some of the packages I used to test with and
see if they explode or not :) the most important thing is that xmllint
--noout doesn't have problems when we're done.

Thanks.

> Well, in my tests I could not find a case where libxml would not
> automatically do the string cleanup, ie. where createrepo would produce
> invalid XML without xmlCleanString() calls in place.  If you have a case
> where the auto-escaping does not happen, let me know.

When I was working on it originally libxml2-python wouldn't play nicely.
If it does now, then great, but I'm afraid of it blowing up on older
versions. I'll see if anything gets odd on older systems.

Thanks.



> Where?  I see RPMTAG_CHANGELOG{TIME,NAME,TEXT}, no author.

You're right. Sorry about that - but I always took name == person who
wrote the changelog so - author.


> 
> > I think just dumping the output as it occurs in the
> > rpm and letting the client program mangle it would be best.
> 
> Agreed, but isn't it that way already?  Changelog is stored in three
> arrays in the rpm header, not one big formatted lump.

And now each entry can be extracted from a list of tuples.

> Accidentally, .htaccess, something?  Whatever, I tend to think that
> createrepo should not choke on it.

So you'd rather it blew away your addition to that dir rather than
warning you about it?

-sv





More information about the Rpm-metadata mailing list