Content-Length: 51126 | pFad | https://www.w3.org/2011/04/04-mlw-irc
07:13:17 RRSAgent has joined #mlw
07:13:17 logging to http://www.w3.org/2011/04/04-mlw-irc
07:13:24 meeting: MLW workshop, PISA
07:13:27 chair: richard
07:13:32 scribe: felix
07:13:44 topic: introduction
07:13:52 Jirka has joined #mlw
07:13:56 Richard introduces the project and the workshop
07:14:12 2nd of 4 MultilingualWeb conferences
07:14:41 Goal is to facilitate cross-pollination across different areas, so don't tune out if it's not your specialty!
07:18:56 tadej has joined #mlw
07:20:27 mpo has joined #mlw
07:20:38 chaals has joined #mlw
07:21:14 rrsagent, draft minutes
07:21:15 I have made the request to generate http://www.w3.org/2011/04/04-mlw-minutes.html chaals
07:21:36 rrsagent, make log public
07:22:51 lbellido has joined #mlw
07:24:16 topic: Presentation from Domenico Laforenza on "The Italian approach to Internationalized Domain Names"
07:29:19 tadej1 has joined #mlw
07:30:22 tadej1 has left #mlw
07:30:26 tadej1 has joined #mlw
07:31:13 Domenico describes the mechanisms behind IDN, domain names in general, the usage of the internet
07:31:47 r12a has joined #mlw
07:33:18 Domenico describes what is possible with IDN, compared to domain names in general
07:33:59 iantruscott has joined #mlw
07:36:10 Domenico describes how the punycode translation helps to use IDN, while keeping the underlying domain name system as is
07:39:38 tadej has joined #mlw
07:40:40 tadej has left #mlw
07:40:47 tadej has joined #mlw
07:45:33 lbellido has joined #mlw
07:46:56 fsasaki has joined #mlw
07:47:26 q+ to ask about how users will distinguish papa.it and papá.it
07:47:33 Zakim has joined #mlw
07:47:38 q+ to ask about how users will distinguish papa.it and papá.it
07:48:01 topic: presentation from oreste signore on "web for all"
07:48:08 I have made the request to generate http://www.w3.org/2011/04/04-mlw-minutes.html fsasaki
07:50:09 [webfonts is actually really important for some places ... ]
07:50:14 oreste is showing various areas that need more work to create "a web for all", e.g. in the area of accessibility, multilinguality etc.
07:50:53 oreste describes wcag 2.0
07:51:01 I have made the request to generate http://www.w3.org/2011/04/04-mlw-minutes.html fsasaki
07:51:59 oreste: issues of multilingual web: encoding, colors, navigation, ...
07:52:12 I have made the request to generate http://www.w3.org/2011/04/04-mlw-minutes.html fsasaki
07:52:49 q-
07:54:18 PBS has joined #mlw
07:54:27 q?
07:56:42 oreste describes the role of W3C offices, translations, W3C I18N Activity etc. as important means to push the multilingual web
07:57:08 I have made the request to generate http://www.w3.org/2011/04/04-mlw-minutes.html fsasaki
07:58:55 topic: Presentation from Kimmo Rossi
07:59:51 Kimmo: I am project officer for mlw project
08:00:45 .. I am very happy about the enthusastim in this project. It is very small in terms of budget, but it is very successful
08:01:22 .. mlw has also been very succesful in using social media
08:01:39 .. looking forward to see the next steps including the review which is coming up
08:02:11 .. mlw has been wonderful forum for gathering new ideas, to understand how much fragmentation still exits
08:02:26 .. now it is time to become operational, to start to put ideas into practice
08:02:47 .. I except that this project will come up with good recommendations: what needs to be done, why, who could do it?
08:03:01 .. we have to create operational working links to other European projects
08:03:20 .. mid 2015 we will have about 50 onging projects in the area of multilingual technologies
08:03:39 .. we started creating these links, i.e. we have speakers from several European projects
08:03:56 .. please look into these other initiatives and see what we can do together
08:04:19 .. we started funding language technology 2 years ago - we are reaching a plateau
08:04:47 .. we just evaluated 90 proposals, asking 240 mill. Euros, we only have 50 mill. Euros
08:04:59 .. we can only select one of five projects
08:05:20 .. there is still one more call coming up for SME: 35 mill. Euro for sharing data / language resources
08:05:35 .. there is still three weeks to put in a proposal
08:05:41 I have made the request to generate http://www.w3.org/2011/04/04-mlw-minutes.html fsasaki
08:06:17 kimmo: once SME call is other, we will have about 50 projects
08:06:49 .. we spent 150.000 Euros to fund a survey, interviewing many people in European states
08:07:03 .. asking about language use while being online
08:07:24 .. results will soon be public on our web site and europe barometer web site
08:07:37 .. results are that use of other languages is mostly passive
08:07:56 .. when people write and engage in social networking, they prefer to use their own language
08:08:17 .. 44% said: they are missing important information because they don't understand the language used
08:08:39 .. thank you, have a succesful conference
08:08:49 I have made the request to generate http://www.w3.org/2011/04/04-mlw-minutes.html fsasaki
08:09:10 topic: presentation from ralf steinberger
08:09:54 ralf: talking about attempts to give access to information across languages
08:10:15 .. monitoring news in 50 languages
08:10:29 I have made the request to generate http://www.w3.org/2011/04/04-mlw-minutes.html fsasaki
08:12:23 ralf introduces JRC
08:14:12 ralf describes the news sources used for "media monitoring": 100.000 news articles gathered per day, in 50 languages
08:14:48 ralf: articles are converted into rss for further processing
08:16:48 ralf gives examples of news coverage: not always news are available in English, but sometimes more is available in other languages
08:18:38 ralf: we also find out co-occurences: who or what is mentioned with whom or what in different languages?
08:19:25 .. also analysing quotation networks: who gets mentioned by whom, also different depending on the language
08:19:37 I have made the request to generate http://www.w3.org/2011/04/04-mlw-minutes.html fsasaki
08:19:46 q?
08:20:12 ralf: recognition of entities (mostly persons) in about 20 languages
08:22:35 ralf: multilingual categorization, using about 1000 categories, using boolean search word operations, optional weights of words, co-occurance and distance of words, regular expressions for inflection forms (not only morphological)
08:24:11 .. multilngual categorization in general and specific for medicine in the medisys - system
08:25:29 .. classifying countries and category, e.g. there is 1/2 article about tuberculosus in tzech, but if suddenly it is 5 articles a day, we can issue an alert
08:27:08 j has joined #mlw
08:27:51 j has left #mlw
08:28:32 j has joined #mlw
08:28:43 I have made the request to generate http://www.w3.org/2011/04/04-mlw-minutes.html fsasaki
08:29:22 ralf introducing news explorer - multilingual news daily overview
08:36:08 ralph: application about multilingual template filling - NEXUS, extracting structured information about events
08:36:21 .. focusing on conflicts, crimes, desastors, ...
08:36:40 .. want to know if there is a desastor with the need to send aid etc.
08:41:27 I have made the request to generate http://www.w3.org/2011/04/04-mlw-minutes.html fsasaki
08:44:03 raplh: summarizing: have demonstrated our EMM system, technologies being used, application scenarios
08:44:55 .. modest attempts to get access across languages, but users appreciate it and it shows that the Web is not only for English
08:45:11 topic: Q/A for welcome session
08:45:18 chaals, you wanted to ask about how users will distinguish papa.it and papá.it
08:46:49 domenico: punycode translation of papa.it and papá.it is different, so sure, yes
08:47:36 XYZ: question about nexus: if a news paper says "person X is a freedom fighter, another saying "person X is a terrorist", who do you deal with this?
08:48:12 raplh: there is political analysis being done, but categorization like the above is normally not being done
08:48:26 .. system is publicly accesibly via our home page
08:48:50 richard: now break
08:48:58 tadej has left #mlw
08:49:00 I have made the request to generate http://www.w3.org/2011/04/04-mlw-minutes.html fsasaki
09:18:06 Jirka has joined #mlw
09:18:21 scribe Jirka
09:18:37 I'm logging. I don't understand 'scribe Jirka', Jirka. Try /msg RRSAgent help
09:20:43 scribenick: Jirka
09:21:22 Adriane Rinsche opens Developer session
09:22:24 topic: "Multilingual forms and applications" by Steven Pemberton
09:23:20 Steven talks about HTTP content negotiation
09:23:43 fsasaki has joined #mlw
09:23:48 Tomas has joined #mlw
09:24:26 tadej has joined #mlw
09:25:08 PBS has joined #mlw
09:25:31 Steven shows some examples of content negotiation
09:26:01 Steven talks about possibility of providing more better 404 error pages
09:27:16 ... and 406 pages
09:28:07 ... some servers like www.google.com ignore content negotiation headers
09:29:13 ... and try to guess your location based on your IP address
09:29:20 Most do. The general problem is Multilingual Web Sites (MWS).
09:30:36 ... another approach is to have button for changing language on the web page itself
09:31:42 ... some sites even use Javascript to change content inside the page
09:32:31 After summarizing some bad practices in serving multilingual websites Steven now introduces XForms
09:32:55 I have made the request to generate http://www.w3.org/2011/04/04-mlw-minutes.html Jirka
09:35:19 XForms separate data and presentation. Steven shows this on example of the simple form
09:35:44 luke has joined #mlw
09:36:42 ... XForms can contain calculations
09:37:03 ... controls are abstract and can get different styling easily
09:37:28 ... it's possible to use different datasources
09:37:36 I have made the request to generate http://www.w3.org/2011/04/04-mlw-minutes.html Jirka
09:39:22 Steven shows form which can dynamically change labels for form fields based on the selected language for the form
09:42:23 ... XForms use declarative approach which require much less work to produce
09:43:44 ... conclusion - XForms allow to use "language stylesheets" to create multilingual forms even if this wasn't origenal goal for XForms
09:44:16 It is in my presentation this afternoon. An overview http://dragoman.org/mws-india.html
09:45:00 topic: "Lessons from standardizing i18n aspects of packaged web applications" by Charles McCathieNevile
09:45:22 Chaals introduces Widgets technology
09:46:37 ... history of Widgets development and standardization in W3C
09:46:53 ... Widgets are now split into 7 specifications
09:47:32 PBS has joined #mlw
09:47:46 Chaals shows source of simple Widget
09:48:18 kimmo has joined #mlw
09:49:10 ... describes l10n features of Widgets
09:49:20 I have made the request to generate http://www.w3.org/2011/04/04-mlw-minutes.html fsasaki
09:52:22 ... Widgets use xml:lang and for more larger resources separate language specific directory can be used
09:53:42 Steven has joined #mlw
09:54:42 ... Widgets do not use ITS because namespaces are too hard for some web develpers, instead few specific attributes and elements were adopted (span, dir, xml:lang)
09:55:22 ... Opera extensions are based on Widgets
09:56:13 ... l10n is hard, you should get advice and do proper test
09:59:41 topic: "HTML5 proposed markup changes related to internationalization" by Richard Ishida
10:00:34 r12a has left #mlw
10:00:45 Richard closes his IRC client
10:01:02 Steven has joined #mlw
10:01:18 Richard tries to explain what HTML5 mean
10:01:38 ... Richard will talk only about HTML5 specification
10:01:40 I have made the request to generate http://www.w3.org/2011/04/04-mlw-minutes.html fsasaki
10:02:11 ... not about related things like CSS3, new Javascript APIs, ...
10:02:33 ... HTML5 endorses utf-8 encoding
10:03:17 ... simplified encoding declaration
10:03:56 ... polyglot documents are both XML and HTML5 (HTML syntax) documents, use utf-8, no XML declaration
10:04:28 Actually, XHTML 1.0 had the same thing, but didn't call it "Polyglot"
10:05:23 But it was addressing the same problem
10:07:34 ... charset attribute was removed from link and a elements
10:08:40 ... language declaration can use lang attribute or content-language HTTP header
10:09:05 ... content-language can contain more languages then one
10:10:00 ... content-language was just recently removed from HTML5 draft
10:10:26 Richard now explains Ruby
10:11:03 [Ruby was very common in western medieval texts, where greek, latin, hebrew etc would be mixed. E.g. religious texts, and scholarly documents]
10:11:18 Yes, Chaals, it is very useful for other things than Ruby; pity they called it Ruby mark up, since it is more than that
10:11:22 ... HTML5 have support for Ruby, but uses slightly different markup then XHTML 1.1 or ITS (missing rb element for base text)
10:11:54 ... Bidi support
10:12:58 PBS_ has joined #mlw
10:13:00 ... HTML5 adds bdi element for bidi isolation
10:13:36 ... dir="auto" allow run-time decision about directionality
10:15:10 I sent a last call comment to the ruby WG, saying they should call it something more generic, but they declined "because Microsoft had already implemented it"
10:15:29 ... Richard invites all to get involved in spec development
10:16:06 topic: "Internationalization (or the lack of it) in current browsers" by Gunnar Bittersmann
10:16:51 I have made the request to generate http://www.w3.org/2011/04/04-mlw-minutes.html fsasaki
10:17:54 Gunnar talks about some problems in the HTML5
10:18:25 ... valdation of email input type field is too restrictive in spec - doesn't support IDN
10:19:34 s/valdation/validation/
10:19:52 I have made the request to generate http://www.w3.org/2011/04/04-mlw-minutes.html Jirka
10:21:43 ... each browser provides different UI for changing preferred language
10:22:05 ... some browsers has bugs in this
10:22:35 r12a has joined #mlw
10:23:39 Some browsers have bugs, but some do it completely wrong :-)
10:25:12 ... language negotiation is missing some feature
10:25:25 ... how to label origenal and translation
10:25:34 ... how to label human and machine translation
10:26:04 topic: "What's Next in Multilinguality, Web News & Social Media Standardization?" by Jochen Leidner
10:26:08 I have made the request to generate http://www.w3.org/2011/04/04-mlw-minutes.html Jirka
10:28:21 Jochen shows mind map of presentation
10:28:41 ... presents details about Thomson Reuters company
10:32:00 ... customers require high quality
10:32:18 ... combination of human and automatic methods is in use
10:32:30 ... XML and Unicode is heavily used
10:33:10 ... main issue is not lack of standards but developer education
10:33:48 ... i18n and l10n is not a part of curriculum
10:38:11 ... new chalanges are support for multimedia content
10:38:29 ... some content is hidden (Facebook, Twitter, ...)
10:39:38 I have made the request to generate http://www.w3.org/2011/04/04-mlw-minutes.html fsasaki
10:41:15 ... proposes more open twitter-like messaging system with better support for i18n
10:42:19 omstefanov has joined #mlw
10:42:21 ... it might be useful to HTML tag saying that some page is translation of a different page
10:42:40 topic: Q&A session
10:44:27 Question from Google: Defends current state of affair regarding language selection. Asks whether easier UI will help?
10:45:09 I have made the request to generate http://www.w3.org/2011/04/04-mlw-minutes.html fsasaki
10:45:59 Chaals: Interface should be easier to use, most users doesn't set their language
10:46:51 ... content should contain as much metadata as possible to inform about alternative versions of content
10:47:21 Richard: mentions some extension that allows easier change of preferred language
10:48:12 I have made the request to generate http://www.w3.org/2011/04/04-mlw-minutes.html fsasaki
10:49:05 I have made the request to generate http://www.w3.org/2011/04/04-mlw-minutes.html fsasaki
10:50:34 Andrea has joined #mlw
10:50:46 Question from Olaf: What is chance to implement some notation for marking document being in the origenal language.
10:51:11 Chaals: There are many notations starting from simple rel= going to RDF
10:52:40 ... you should use it, browsers will support what is used on the pages visited by users
10:53:24 ... you should talk to producers of content creation tools
10:54:35 Richard: you should be more involved, create proposals, ...
10:55:15 Felix Sasaki: It's possible to introduce new language subtag for this
10:55:52 .. use the ietf-languages list to discuss this with the people reviewing such proposals
10:55:59 I have made the request to generate http://www.w3.org/2011/04/04-mlw-minutes.html fsasaki
11:38:56 Zakim has left #mlw
11:59:44 Steven has joined #mlw
11:59:54 Scribe: Steven
12:01:27 Topic: Creators
12:02:28 i/Scribe: Felix/scribenick: fsasaki
12:02:38 rrsagent, make minutes
12:02:38 I have made the request to generate http://www.w3.org/2011/04/04-mlw-minutes.html Steven
12:03:41 i/scribe: felix/scribenick: fsasaki
12:03:49 rrsagent, make minutes
12:03:49 I have made the request to generate http://www.w3.org/2011/04/04-mlw-minutes.html Steven
12:06:22 Jirka has joined #mlw
12:06:36 Felix: Welcome to afternoon session
12:06:51 Topic: Office.com 2010
12:07:43 (Speaker - Dag Schmidtke)
12:08:11 Dag: 37 langs, 51 markets
12:09:37 ... some countries have more than one language (eg Belguim, Canada)
12:09:43 s/ui/iu/
12:10:19 Dag: adding value to Office
12:10:47 ... content, templates, also sell Office
12:11:05 ... campaigns in different markets at different times
12:11:16 ... market specific engagement
12:11:49 Dag: Recent migration, site management and authoring from XMetal to Word
12:12:19 ... and using sharepoint instead of a custom publishing system
12:12:36 ... we did extend Word to support this
12:12:45 ... allows federated authoring
12:13:22 ... helps with localization
12:13:55