/srv/irclogs.ubuntu.com/2023/10/14/#ubuntu-uk.txt

=== AMD is now known as Nokaji
daftykinshmm, fascinating situation on a client system once again - i might well have shared how i had worked to batch convert scanned documents at one place to reduce the size on disk by a huge margin13:56
daftykinsthis time, i found a practice where half the data size of an application was taken up with 6,600 KB .RTF files representing just tiny word processed documents13:57
daftykinsseems to be something nutty about a typical RTF wherein it's all raw ASCII with no compression - and tonnes of tags for formatting, making up the overall size13:57
daftykinsi just fed January 2012's 189 .RTF files into LibreOffice batch conversion... input size: 1.25 GB - output size: 92 MB ; 7.17 % of original13:58
zxmpii've had situations like that and compressing works for 99% of the documents fine but there's 1% that when you compress them they become unreadable blurs13:58
daftykinsheh, yeah the added challenge in this case is that i needed them to keep the original file name *and* pretend to be .RTF still so that they'd open from the program in MS Word, thankfully it works fine and doesn't care they're not really RTFs13:59
daftykinsthe only quirk is it puts spelling error squiggles under loads of normal words, some kind of additions must have been made that make it go squirrely14:00
zxmpiyeah, windows hates file extensions and can ignore them for some apps14:00
penguin42daftykins: Sorry, I'm unclear; are you resaving them as rtf or plain txt?15:52
daftykinsah i did neglect that part yes, well since i need to preserve their opening, they're noew .docx *but* named .RTF :D15:52
daftykins*now15:52
daftykinsdeveloper refuses to offer any assistance in editing the database to reflect a file extension change15:53
daftykinsthey seem to write into a flat file format with pairs of files named .FS5 and .IDX15:53
penguin42had you tried just resaving them back as rtf?15:53
daftykins(in case you've ever heard of those)15:53
penguin42i.e. whether LO's RTF writer is any more concise?15:54
daftykinsyeah that drops them to a smaller size, but still 3x larger than .docx15:54
penguin42yeh, docx is gzip'd or is it zip)15:54
daftykinsmm zip xml aiui15:55
daftykinsso 6,958 KB RTF, 500 KB .docx, 1,462 KB .RTF resaved15:55
penguin42I ownder if you can tell LO to change the language in the docx so it doesn't try and spell check15:56
daftykinsit's an odd one, normal words come up as bad and yet i see English UK at the bottom just fine15:57
daftykinswith LO that is, didn't spend much time in MS Word on their end15:58
daftykinsi lose all the create/modified dates as well of course due to processing15:58
penguin42restamp those?16:03
daftykinsi don't know a viable method for that off hand16:03
* penguin42 wishes Toolstations order system didn't blatantly lie and say it could process an order in 5mins16:04
daftykinshttps://www.twitch.tv/nasa16:33
daftykinseclipse over in New Mexico16:33
zxmpithe cosmic ballet goes on https://www.youtube.com/watch?v=FmoW-gNjjXA16:43
* penguin42 just walked 3x3m length of trunking home from Toolstation17:13
daftykinsarms like jelly now?17:33
penguin42one, yes :-)17:33
penguin42didn't even get too many odd looks either :-)17:34
daftykinsif anyone glances for too long at whatever i'm carrying, i like to say i'm taking it for a walk17:34

Generated by irclog2html.py 2.7 by Marius Gedminas - find it at mg.pov.lt!