The Ur-Quan Masters Home Page Welcome, Guest. Please login or register.
Did you miss your activation email?
October 04, 2024, 04:27:56 pm
Home Help Search Login Register
News: Celebrating 30 years of Star Control 2 - The Ur-Quan Masters

+  The Ur-Quan Masters Discussion Forum
|-+  The Ur-Quan Masters Re-Release
| |-+  General UQM Discussion (Moderator: Death 999)
| | |-+  Translation update
« previous next »
Pages: [1] 2 Print
Author Topic: Translation update  (Read 4556 times)
meep-eep
Forum Admin
Enlightened
*****
Offline Offline

Posts: 2847



View Profile
Translation update
« on: July 04, 2005, 07:21:19 pm »

I've created a new page in the Ultronomicon for information on translation projects. I advice all those who are or want to be involved with such projects to read it and perhaps provide additions.
(Link).
Logged

“When Juffo-Wup is complete
when at last there is no Void, no Non
when the Creators return
then we can finally rest.”
MrSad
Zebranky food
*
Offline Offline

Gender: Male
Posts: 7


Virtual Life Form


View Profile WWW
Re: Translation update
« Reply #1 on: July 05, 2005, 02:50:13 pm »

Sorry for possible offtopic, but I couldn't found the answer in the link you provided.
How to force UQM show symbols with numbers 128-255?
I copied 65.png (latin 'A') to 192.png (russian 'A') to same directory (\comm\comandr\comandr.fon\), saved it to uqm-0.4.0-content.uqm file. And put some 'A' letters in the dialog. But when I talk to starbase commander the text shows incorrectly.
Logged

Virtually yours, Mr.Sad
Novus
Enlightened
*****
Offline Offline

Gender: Male
Posts: 1938


Fot or not?


View Profile
Re: Translation update
« Reply #2 on: July 05, 2005, 03:41:21 pm »

Quote
Sorry for possible offtopic, but I couldn't found the answer in the link you provided.
How to force UQM show symbols with numbers 128-255?
I copied 65.png (latin 'A') to 192.png (russian 'A') to same directory (\comm\comandr\comandr.fon\), saved it to uqm-0.4.0-content.uqm file. And put some 'A' letters in the dialog. But when I talk to starbase commander the text shows incorrectly.

I'm assuming you're using UQM 0.4.

Judging from your example, you're assuming the text is in Windows-1251 while UQM (according to the Ultronomicon) uses UTF-8. As UTF-8 is a variable-length encoding and uses byte values from 128 to 255 for multi-byte characters, putting the byte 192 in a file and expecting to get the character 192 shown in UQM does not work. Convert your text file to UTF-8 and change the file names of the additional characters to correspond to their Unicode values.

Interestingly, the Finnish and German translations both seem to be in ISO-8859-1 (Latin 1), as they are apparently based on UQM 0.3 with the fix for chars above 136.
Logged

RTFM = Read the fine manual.
RTTFAQ = Read the Ur-Quan Masters Technical FAQ.
MrSad
Zebranky food
*
Offline Offline

Gender: Male
Posts: 7


Virtual Life Form


View Profile WWW
Re: Translation update
« Reply #3 on: July 05, 2005, 05:17:07 pm »

I don't know what editor uses unicode. FAR, notepad and MS Word can't. I could write a converter, but U0400.pdf that describes cyrillic codes is password protected. It seems that everything is done to make translation process impossible.

I see only way to solve this: change symbols 65-127 to russian. To get rid of every latin symbol in game. That's what I call "Per anus ad astra"...

P.S. I've translated many games from english to russian. Civilization 3 and Warhammer were the largest of them with hundreds kilobytes of texts. Help me with fonts and formats and I promise there will be a russian version of UQM.
« Last Edit: July 05, 2005, 06:08:56 pm by MrSad » Logged

Virtually yours, Mr.Sad
meep-eep
Forum Admin
Enlightened
*****
Offline Offline

Posts: 2847



View Profile
Re: Translation update
« Reply #4 on: July 05, 2005, 06:32:47 pm »

Modern versions of Notepad should be able to save files as UTF-8. And there are plenty of other editors and existing conversion tools that may help you. Google is your friend.
http://www.jedit.org/ looks promising.

The .pdf unicode tables from unicode.org are not password protected. If you get a message that "your viewer cannot decrypt the file" you're using a version of Acrobat Reader that's too old. The solution is obvious.

As for "It seems that everything is done to make translation process impossible.", I'm not going to bite.

As you can read on the Ultronomicon page I linked to, a Russian version is already underway. You may want to join that team instead of doing double work.
Logged

“When Juffo-Wup is complete
when at last there is no Void, no Non
when the Creators return
then we can finally rest.”
Novus
Enlightened
*****
Offline Offline

Gender: Male
Posts: 1938


Fot or not?


View Profile
Re: Translation update
« Reply #5 on: July 05, 2005, 08:54:44 pm »

Quote
I don't know what editor uses unicode. FAR, notepad and MS Word can't.

I like Emacs myself, but it's a bit... idiosyncratic. But even Notepad can save in UTF-8.
Logged

RTFM = Read the fine manual.
RTTFAQ = Read the Ur-Quan Masters Technical FAQ.
GDP_Spaceman
Zebranky food
*
Offline Offline

Gender: Male
Posts: 23


SC2 for ever!!


View Profile WWW
Re: Translation update
« Reply #6 on: July 11, 2005, 10:48:34 pm »

Quote
How to force UQM show symbols with numbers 128-255?
I fixed the problem by add few line in the function "getCharFromString"
(src/sc2code/strings/unicode.c)
Logged

It's time for a german SC2 Smiley
Project page at http://www.gdp-gameware.de
Novus
Enlightened
*****
Offline Offline

Gender: Male
Posts: 1938


Fot or not?


View Profile
Re: Translation update
« Reply #7 on: July 12, 2005, 03:13:10 pm »

Quote
How to force UQM show symbols with numbers 128-255?
I fixed the problem by add few line in the function "getCharFromString"
(src/sc2code/strings/unicode.c)
If by "fixed", you mean "changed UTF-8 handling in such a way that the ISO Latin-1 codes for ä, ö and ü (and the same in upper case) are passed straight through the decoder, causing lots of seemingly random corruption in proper UTF-8 text in a wide variety of languages", I agree. Instead of converting the text files to UTF-8, you hack UQM to make it display your Latin-1 text despite the best efforts of the UQM core team to move to Unicode so that translation projects don't need to modify UQM. You've missed the point of using Unicode completely, just like our Russian friend.

Why does everyone seem to have such a problem with UTF-8? Every editor I use under XP and Linux supports UTF-8 properly, and most of the time, UTF-8 files are even autodetected.

Sorry about the rant; I don't mean to offend the German translation crew (thanks for the source release, BTW; I was curious about how you were handling character set issues). However, I think the translators are increasing their own burden by forking UQM unnecessarily. Translations should be drop-in content replacement packs, not complete rewrites! This makes life a lot easier both for the maintainers of the translations and the users.
Logged

RTFM = Read the fine manual.
RTTFAQ = Read the Ur-Quan Masters Technical FAQ.
GDP_Spaceman
Zebranky food
*
Offline Offline

Gender: Male
Posts: 23


SC2 for ever!!


View Profile WWW
Re: Translation update
« Reply #8 on: July 13, 2005, 01:00:14 am »

I must hack UQM in any case, because to translate the hard-coded text.

How can i convert the files to UTF-8 ?
Which font is a UTF-8 font?
Logged

It's time for a german SC2 Smiley
Project page at http://www.gdp-gameware.de
meep-eep
Forum Admin
Enlightened
*****
Offline Offline

Posts: 2847



View Profile
Re: Translation update
« Reply #9 on: July 13, 2005, 01:05:14 am »

See this posting.
Logged

“When Juffo-Wup is complete
when at last there is no Void, no Non
when the Creators return
then we can finally rest.”
Novus
Enlightened
*****
Offline Offline

Gender: Male
Posts: 1938


Fot or not?


View Profile
Re: Translation update
« Reply #10 on: July 13, 2005, 01:37:36 am »

I must hack UQM in any case, because to translate the hard-coded text.
True, but this will probably change before 1.0. The long-term goal is to use the same executable for all translations.

Quote
How can i convert the files to UTF-8 ?
Which font is a UTF-8 font?
In modern Windows versions, Unicode is used internally. Essentially, this means that most fonts are Unicode fonts, although few of them contain the entire character set. The same applies to most of the relevant Unix libraries. Unicode is, loosely speaking, a 21-bit character set as opposed to the usual 8-bit character sets, and supports every known written language short of Klingon.

Unicode can be encoded in several different ways. UTF-32 is a straight-forward but very inefficient way of encoding Unicode (represent each 21-bit value as a 32-bit word). UTF-8 encodes 0 to 127 as single bytes (making all 7-bit ASCII text valid UTF-8) and uses the values 128-255 to indicate multi-byte characters (values of 128 and above). You could consider UTF-8 to be a compressed form of Unicode and UTF-32 to be the "raw" encoding.

Most modern editors can save as UTF-8. In Windows XP Notepad, the character encoding is an option in the "Save As..." dialogue box.

The reason I'm surprised that saving to UTF-8 is a problem is that the Linux distribution I use has been defaulting to UTF-8 for about three years.

GNU recode is a convenient character set conversion utility.
Logged

RTFM = Read the fine manual.
RTTFAQ = Read the Ur-Quan Masters Technical FAQ.
GDP_Spaceman
Zebranky food
*
Offline Offline

Gender: Male
Posts: 23


SC2 for ever!!


View Profile WWW
Re: Translation update
« Reply #11 on: July 13, 2005, 10:50:27 pm »

Quote
option in the "Save As..." dialogue box.
Thats it  Grin

I convert the files if  the first UQM with outsourced text is finished.
Logged

It's time for a german SC2 Smiley
Project page at http://www.gdp-gameware.de
McLay
Guest


Email
Re: Translation update
« Reply #12 on: July 15, 2005, 03:28:58 pm »

Hi. I have whole translated text to russia, but collided with some problems with fixed length strings. This is a months abbreviations(FEB, JAN...), string "ON" and "OFF", "Enter name"...and maybe some other.

As far as i know, cyrillic char has a two byte len, and english has one. So, i can translate "FEB" to string with one cyr char and one eng char Smiley

Do you planning to fix this errors? If yes, ill try to report about other similar problems.


ps. sorry about my english. text translation was made by other peoples Wink
Logged
meep-eep
Forum Admin
Enlightened
*****
Offline Offline

Posts: 2847



View Profile
Re: Translation update
« Reply #13 on: July 15, 2005, 06:08:28 pm »

I just fixed the issue with months in cvs. But what's wrong with ON/OFF and "Enter Name"?

Also read this.


Edit; Ok, I see the problem with "Enter Name" (an input issue rather than an output issue).
That leaves ON/OFF. What's wrong with it?
« Last Edit: July 15, 2005, 07:50:22 pm by meep-eep » Logged

“When Juffo-Wup is complete
when at last there is no Void, no Non
when the Creators return
then we can finally rest.”
McLay
Zebranky food
*
Offline Offline

Gender: Male
Posts: 3



View Profile
Re: Translation update
« Reply #14 on: July 15, 2005, 10:23:21 pm »

Ups, sorry, it was bug with "Cyborg" string (not ON/OFF).
If translated(to cyrillic) 'Cyborg' string len = 6, then game crashed.

Quote
Also read this.
Ok.
Logged
Pages: [1] 2 Print 
« previous next »
Jump to:  


Login with username, password and session length

Powered by MySQL Powered by PHP Powered by SMF 1.1.21 | SMF © 2015, Simple Machines Valid XHTML 1.0! Valid CSS!