IRC log for #koha, 2006-04-06

All times shown according to UTC.

Time S Nick Message
16:46 slef http://www.jet2.com/News.aspx?id=65 "Jet2.com [...] calls for lazy frogs to get back to work!"
16:46 slef Jet2 - I can't believe it's an airline.
16:46 owen Charming
16:47 owen chris, you around?
22:51 russ cos the other channel is too busy
22:51 chris now in koha?
02:25 hdl hi pierrick
02:26 pierrick hi hdl & paul
02:40 paul plein de français ce matin encore...
02:40 paul osmoze ?
02:41 paul est-ce que tu as eu le temps de tester l'install windows ?
02:41 osmoze bonjour
02:41 paul bonjour d'abord, bonne remarque !
02:41 osmoze j avais commencer mais pas terminé...Tiens, tu me choppes au reveille, j ai rien prevu ce matin, j attaque :)
02:42 osmoze (en fait, j ai fais l install avec phpmyadmin et au final pb de perl :()
03:14 paul pierrick hésite beaucoup ce matin encore...
03:14 paul a t'il l'utilité d'un _ ou pas ?
03:14 paul bonne question...
03:15 pierrick désolé, problemes de connexion internet chez INEO
03:15 paul je sais je sais. Ca m'amuse juste ;-)
03:16 paul (et ton login apparait chez moi en vert pale, du coup le _ est difficile à voir. donc faut que je fasse attention si je veux te "bipper"
03:16 pierrick tu utilises quoi comme client IRC ?
03:16 pierrick parce que irssi ou Xchat font l'autocomplétion des nicks présent sur le chanel.
03:16 pierrick s/chanel/channel/
03:17 paul pierrick: ah, c'est vrai, KSirc aussi
03:17 paul mais j'y pense jamais !
03:17 pierrick paul, tu as combien de stopwords dans ta base de dev HEAD?
03:22 paul ca dépend de la base.
03:22 paul plusieurs.
03:22 paul (dans la plupart des cas, j'ai ce qui est mis par défaut à l'install.
03:22 paul ca doit faire une 50aine de mots
03:23 pierrick OK, je suis en train de reprendre la page de gestion des stopwords
03:24 pierrick en partant du principe que les stopwords sont courts et qu'on peut faire peut d'opération dessus. Tu me diras ce que tu en penses. (initialement c'était juste pour ajouter la barre de pagination)
03:24 paul OK.
03:24 paul quoi que : STOP
03:24 paul ne perd pas de temps là dessus. Les mots vides seront inutiles dans Koha 3.0 probablement
03:24 paul vu que la recherche srea gérée par zebra !
03:24 paul qui utilise d'autres mécanismes.
03:24 paul quoi que finalement :
03:25 paul les mots vides sont à mettre dans un fichier à part. Peut être que l'on génèrera ce fichier à partir de la table stopwords.
03:25 paul bref, je ne suis pas trop sûr.
03:25 paul mais ne passe pas 1 journée là dessus ;-)
03:28 pierrick non, pas un journée, je te rassure, de toute façon, je vais chez le fameux client dans une heure
03:28 paul "chez le fameux client" ?
03:29 pierrick un gros client d'INEO
04:40 osmoze hdl>  ?
04:41 osmoze paul>  ?
04:42 paul ye
04:42 paul yep
04:42 osmoze non c edst bon paul merci :)
04:43 osmoze (enfin désolé ^^)
07:13 osmoze hdl ?
07:17 hdl salut
07:40 kados morning all
07:40 kados paul: do you have a quick second?
07:41 kados paul: I'm importing some records and I'm afraid they are truncating
07:41 kados paul: with bulkmarcimport
07:41 kados I get the following error in the log several hundred times:
07:41 kados no mapping found at position 8 in Price : ~\ 36.00;    Inv.# CSI102254;    Date  30/11/98; Supplier : CroftHouse, Surrey, UK;  Recd 11/12/98;  Contents : 1. Bringing your technology to market; 2. So do you really have something of value? 3. The first steps towards commercialisation of your technology; 4. The difficult problem of valuation of intellectual property; 5. Developing your ideas; 6. The licensing option; 7. Forming your own company; 8. Financing the busines
07:45 paul hi kados.
07:45 paul "no mapping found at position 8".
07:45 paul I never saw this before.
07:45 paul could it be a MARC::charset problem ?
07:45 paul ("mapping" is sometihng MARC::charset does)
07:46 paul (thus my idea)
07:49 kados yep, could be
07:49 kados just wondering if you had seen it before
07:49 paul so, the answer is no ;-)
07:49 kados :-)
07:50 paul was it iso2790 ?
07:51 kados yes
07:51 kados and claims to be MARC-8
07:51 kados encoded
07:52 kados the record imported fine
07:52 kados but the data in 856 was discarded by bulkmarcimport
07:52 kados http://wipoopac.liblime.com/cg[…]tail.pl?bib=24436
07:52 kados 856 contained:
07:52 kados 856    _aPrice : <9C> 36.00;    Inv.# CSI102254;    Date  30/11/98; Supplier : C
07:52 kados roftHouse, Surrey, UK;  Recd 11/12/98;  Contents : 1. Bringing your technology t
07:52 kados o market; 2. So do you really have something of value? 3. The first steps toward
07:52 kados s commercialisation of your technology; 4. The difficult problem of valuation of
07:52 kados intellectual property; 5. Developing your ideas; 6. The licensing option; 7. Fo
07:52 kados rming your own company; 8. Financing the business start up; 9. Making your techn
07:53 kados ology a commercial success;
07:53 kados which I can view fine in dumpmarc.pl
07:54 paul you write a specific css for every customer ?
07:56 paul (/me happy to see that it's wipo ;-) )
07:56 kados yes, most customers have custom css for at least color, some for layout as well
07:56 paul they want it, or you propose ?
07:57 kados it's a required item
07:57 paul (because if I don't propose, they don't ask !)
07:57 kados if they complain, we drop it :-)
07:57 kados because we have found everyone wants their Koha tweaked in some way
07:57 kados and we can't afford to do it for free :-)
07:59 paul how many do you charge for this feature ?
08:16 kados paul: quick questions when you're off phone
08:16 kados paul: what do you think about a syspref for 'encoding'
08:17 kados paul: that allows the librarian to specify the encoding scheme to be used in intranet and opac templates?
08:40 kados paul: also, we must discuss encoding probs when you are available
08:40 kados paul: in rel22 and head
08:40 kados paul: well, rel24 now :-)
08:41 kados paul: it took me about 5 minutes to add a TemplateEncoding variable to Auth.pm, and the templates ... I will add it to CVS and to default templates if you ask me too
08:42 kados paul: i think utf-8 catalogs will appreciate it
08:42 kados paul: as they won't need to edit the template directly
08:42 paul IPT thanks a LOT the guy who wrote CloneTag & CloneSubfield !!!
08:42 kados great! :-)
08:43 kados do they like 'subfield reordering' too?
08:45 paul yep
08:45 paul (still on phone)
09:27 paul kados : i'm back
09:27 paul explain what you're speaking of with Encoding ;-)
09:31 kados encoding:
09:31 kados right now, in 2.4, utf-8 is default
09:31 kados in templates
09:31 kados but it's a real problem if a library has some other encoding
09:31 kados they must edit all template headers to change the value
09:31 kados utf-8 => 8859-1, etc.
09:32 kados so why not just have a template syspref called 'TemplateEncoding'
09:32 kados then, in templates, put:
09:32 kados <!-- TMPL_IF NAME="TemplateEncoding" -->
09:32 kados <meta http-equiv="Content-Type" content="text/html; charset=<!-- TMPL_VAR NAME='TemplateEncoding' -->" />
09:32 kados <!-- TMPL_ELSE -->
09:32 kados <meta http-equiv="Content-Type" content="text/html; charset=utf-8" />
09:32 kados <!-- /TMPL_IF -->
09:32 kados in fact, I've already done this
09:33 kados and it works perfectly
09:33 kados if you want I will commit to 2.4 immediately
09:33 kados it only takes about 5 mintes to add the template code
09:33 kados for default
09:33 kados paul: ?
09:33 paul (on phone again !)
09:33 kados hehe
09:38 paul back
09:38 paul utf8 is NOT the default for me in 2.4
09:38 paul (I mean, at least in default templates)
09:38 paul I've seen that you have it in npl, but it's not in default or css
09:38 paul but ok for this patch.
09:39 kados ok, thanks
09:39 paul but defaulting it to iso8859-1 !
09:39 kados absolutely
09:39 paul as it's what existing libraries have.
09:39 kados another problem
09:39 kados I've got
09:39 kados well, wait
09:39 kados we still have some problems in 2.4 related to encoding
09:39 kados for instance, with the marc editor
09:40 kados but before we discuss that
09:40 kados observe this record:
09:40 kados http://wipoopac.liblime.com/cg[…]tail.pl?bib=17996
09:40 kados in 245 $a (title)
09:40 paul a suggestion (for wipoopac) : the footer would be better with 2 links I think : 1 for koha.org & 1 for liblime.com
09:40 kados ahh, good idea
09:41 kados in 245 $a (title), there are several special characters
09:41 kados but I believe they are ruined on import
09:41 kados or else 8859 is not the proper encoding to display them
09:41 kados (as wipo is set to 8859)
09:41 paul mmm... I bet it's char_decode that is wrong on this specific german char.
09:42 paul it should be über or Über
09:42 kados is char_decode for MARC-8 data?
09:42 kados in fact, there are many characters
09:42 kados that are incorrect
09:42 paul yep.
09:42 kados I'm not even sure how to begin troubleshooting
09:42 paul this char_decode causes a lot of problems.
09:43 paul you could try to play with Encode module !
09:43 paul I think it's the best way to do it.
09:43 kados one idea I had was to encode the entire collection as utf-8
09:43 kados on import
09:43 kados this would solve problem #2
09:44 kados which you will want to know about
09:44 kados as it relates to the new MARC editor
09:44 kados the character encoding of the new MARC editor is utf-8 by default
09:45 kados it may attempt to convert any other char encoding to utf-8 when editing a record
09:45 kados and then save the record as utf-8
09:45 paul you mean even for default templates ?
09:45 kados it's in the MARChtml2xml routine in Biblio.pm
09:46 kados so yes, even for default I'm afraid
09:46 kados I posted a warning in cvs notes when committing this code
09:46 kados but didn't hear anything :-)
09:47 kados the problem is that xml must be in utf-8
09:47 kados perhaps the solution is to re-encode in the original encoding after using xml
09:47 kados slef++
09:48 kados slef: we're drowning in encoding problems
09:48 slef kados: I thought all flour in the US was drugged already?
09:48 owen slef: that's why we're all so happy over here
09:48 kados slef: I eat organic flour
09:48 kados slef: no drugs :-)
09:48 slef kados: you sure? Even some of our organic flour is drugged already.
09:49 kados slef: if you want to bail us out of our encoding probs, please do
09:49 owen Paul, I have a question for you about C4::Context->userenv->{branch}
09:49 paul owen : ask hdl, he wrote userenv.
09:50 owen Okay, for whomever: Is there a simple way to pull the branch name at the same time, rather than just the branch code?
09:50 kados isn't it just 'branchname'?
09:51 kados if not, should be fairly painless to add it to userenv
09:51 hdl owen yes.
09:52 hdl I read.
09:52 hdl I do it and commit.
09:53 owen Thanks, that's going to be a REALLY BIG help for our librarians
09:54 shedges ...and I won't be there to get any credit for the improvement :-(
09:55 owen No, but we'll put up a commemorative plaque for hdl ;)
09:57 shedges in French?
09:59 kados heh
10:06 paul kados : you're right, we have a major problem in MARC editor on default !!!
10:07 paul (I did not saw it on add, just on modif !)
10:07 hdl commited in 2.2 branch
10:07 kados right
10:07 kados paul: check addbiblio.pl for an 'encode' line
10:08 kados paul: if there, try removing it
10:08 kados paul: and see if modif works
10:08 paul there is one, but it's commented
10:08 kados ok
10:08 paul (just below "sub create_input")
10:08 kados right
10:08 kados so I'm quite puzzled about how to proceed
10:09 kados I can reintroduce char_encode, or I can try to build a better method
10:09 kados slef: could use some of your input on this as well
10:11 kados paul: so we could :
10:11 kados 1. check encoding of all records going into editor
10:12 kados 2. preserve encoding before running MARChtml2xml
10:12 kados 3. convert back to old encoding after MARChtml2xml
10:12 kados because MARChtml2xml (which replaces the very buggy MARChtml2marc) requires utf-8
10:13 kados I dont' think any xml parsers recognize marc-8 or some of the other strange encodings in library records
10:14 kados paul: do you have any ideas?
10:14 paul ... thinking...
10:15 paul hdl : about your previous commit : left join is faster than where.
10:16 paul so you had better do a left join to get branchname, as it will be called on every page !
10:16 owen also:
10:16 owen Global symbol "$branchname" requires explicit package name at /home/oleonard/koha/intranet/modules//C4/Auth.pm line 409.
10:16 owen Global symbol "$branchname" requires explicit package name at /home/oleonard/koha/intranet/modules//C4/Auth.pm line 414.
10:16 owen Global symbol "$branchname" requires explicit package name at /home/oleonard/koha/intranet/modules//C4/Auth.pm line 418.
10:16 owen Global symbol "$branchname" requires explicit package name at /home/oleonard/koha/intranet/modules//C4/Auth.pm line 429.
10:17 paul hdl : you just loose your commemorative plaque !
10:17 kados hehe
10:17 paul kados : 1/2/3 : do you mean we will have to do 1 AND 2 AND 3 everytime ?
10:17 slef kados: I guess we start with utf-8 in the xml bit and then try to expand it outwards to cover all koha internals.
10:17 hdl hey I'm not dead yet !
10:18 slef by the way folks, msg me if I seem to be away and I'll see it later
10:18 paul I will really become mad with utf8 & Perl !
10:18 kados paul: me too!
10:19 kados paul: and with xml parsers for not supporting other encodings
10:19 kados paul: and for MARC::Record and MARC::File::XML and MARC::Charset :-)
10:19 kados paul: we have several cases:
10:19 kados 1. record coming from Z3950
10:19 kados 2. record coming from resevoir
10:20 kados 3. record coming from Koha (imported with bulkimport)
10:20 kados 4. record coming from KOha (previously edited/added in Koha)
10:20 paul for Instance, I just want to solve : editing a record coming from Koha !
10:20 kados right
10:21 kados so everything is edited as 8859 with your clients?
10:21 kados since your templates are 8859?
10:21 paul yep
10:21 kados so what if a record is not 8859, does it currently display incorrectly when editing?
10:21 kados (in 2.2.5)
10:22 kados (if the record is outside of ascii range?)
10:26 kados I read that as well, it scares me :-)
10:26 paul scares ?
10:27 kados I am scared to guess the encoding :-)
10:27 kados as it would be hard to confirm it was correct with 50K records
10:28 paul what do you did for NPL ?
10:28 kados for encoding?
10:28 paul as they have some biblios with diacritics iirc.
10:29 paul are they in UTF8 ?
10:29 kados no, they still use 8859
10:29 paul so the new MARC editor don't work for them ?
10:29 kados but I suspect it's a problem Koha has had for some time
10:29 kados no, I must fix this problem
10:30 kados but in fact, Koha already was broken for them with regards to encodings
10:30 kados I suspect that MARC-8 was never supported in Koha
10:30 kados because it probably does not exist in UNICODE :-)
10:31 kados so if we assume we only must deal with two encodings in 2.4
10:32 kados our solution could be quite simple
10:32 kados check TemplateEncoding var
10:32 kados if 8859-1
10:33 paul (ok, let's say we have only 2 encodings, that's OK for me !)
10:33 kados convert to utf-8 in MARChtml2xml at the beginning
10:33 kados then, after MARChtml2xml, convert back to 8859-1
10:33 kados so the user will never know it was in utf-8
10:33 paul but... when I reach addbiblio, I immediatly get a problem. html2xml has not be called !
10:33 paul Biblio.pm just call MARCgetbiblio, that is xml independant, right ?
10:34 kados is that because you are editing one you already edited?
10:34 paul yep
10:34 kados if you edit or add one with addbiblio, it is utf-8
10:34 kados MARCgetbiblio is xml independant
10:34 kados only MARChtml2xml uses XML
10:35 paul ??? I don't understand, sorry
10:35 kados ok ... here's the history
10:35 kados MARChtml2marc was very buggy
10:35 kados it lost data for instance
10:35 kados and was very difficult to read
10:36 kados so i wrote a new method that was easier to read
10:36 kados it converts the html form submission into MARCxml
10:36 paul (let me know when the history if finished)
10:36 kados hehe
10:36 kados finished
10:37 paul so, go to
10:37 paul http://i8.bureau.paulpoulain.c[…]kcode=Default&op=
10:37 paul (test/test should work)
10:37 kados ok, I'm there
10:37 paul 020 tag :
10:37 paul should be Numéro
10:37 paul 200$a should be
10:38 paul L'autorité
10:38 paul so :
10:38 kados ahh ... this is not in the _data_ but in the descriptions!
10:38 paul diacritics are gone for TAG labels as well as datas !
10:38 paul If you add a new biblio, you'll get empty data, of cours, but VALID labels !
10:39 paul who transformed descriptions ???
10:39 kados not me
10:39 kados hmmm ...
10:39 kados was char_encode used to dynamically transform descriptions in the editor?
10:40 kados in previous version of Koha?
10:40 paul hint : on the previous link, force "encoding utf8" in firefox
10:40 paul you'll get correct labels & wrong (very strange !) datas
10:41 kados are you sure you did not convert mysql to have utf-8 in the tables on this test box?
10:41 paul yep.
10:41 paul (& it works perfectly when you look at a biblio or create a new one)
10:42 kados wow, that is strange
10:42 kados but labels are stored in the same place, right?
10:42 kados each record does not have it's own copy of a label
10:42 osmoze bye all
10:43 hdl owen recommited.
10:43 paul and get :
10:43 paul [Wed Apr 05 17:43:17 2006] [error] [client 213.41.184.186] not well-formed (invalid token) at line 11, column 19, byte 480 at /usr/lib/perl5/vendor_perl/5.8.6/i386-linux/XML/Parser.pm line 187, referer: http://i8.bureau.paulpoulain.c[…]mple/addbiblio.pl
10:44 paul :-(((
10:46 paul which one(s) ?
10:46 kados and not have a marc editor incapable of producing valid marc
10:46 paul html2marc or html2xml ?
10:46 kados the encoding probs are easier to fix IMO
10:47 kados esp if we agree to only deal with two encodings
10:47 paul 1st question then :
10:47 paul why has the previous page wrong labels ?
10:47 paul that's really strange !
10:47 kados I'm checking
10:47 kados agreed, quite strange
10:48 kados the page must be build differently for edits than for new records
10:51 paul something strange is probably done on line 527+/-
10:51 paul OK, you were wrong ;-)
10:52 paul line 423, if we are in a mod = read biblio from MARCgetbibio
10:52 paul and line 527/532 => read it again, but FROM XML !
10:52 paul just comment line 532, and everything is better !
10:53 paul (although i'm not sure there is no side effect !)
10:54 kados ahh
10:54 paul internal server error when saving !
10:55 paul not well-formed (invalid token) at line 15, column 34, byte 635...
10:55 kados in fact, I find addbiblio quite confusing
10:56 kados paul: I'll install unimarc koha and troubleshoot this
10:59 paul kados : did you tried to search /pancakes on team42 interface ?
10:59 paul really LOL !
10:59 kados paul: :-)
10:59 kados paul: did ToinS discover that? :-)
10:59 paul yes
10:59 ToinS yes !
11:00 kados it's quite funny :-)
11:00 kados It think there are some others as well
11:00 paul yep. it's the kind of goodies we could add to Koha (like pictures of the core team)
11:00 kados yea :-)
11:01 paul OK, 6PM, I hope i'll have a fix for this problem in koha-cvs tomorrow morning ;-)
11:01 paul otherwise, i'll investigate too.
11:01 kados ok, I"ll do my best
11:01 paul don't forget to mail me if you find something interesting even if you don't catch the solution
11:01 kados ok
11:01 ToinS Bye all ...
11:01 paul do youwant a complete french DB ?
11:02 kados paul: yes please :-)
11:16 hdl owen : have you tried my fix.
11:16 hdl ?
11:16 owen Yes, it works great. Thank you!

| Channels | #koha index | Today | | Search | Google Search | Plain-Text | plain, newest first | summary