Log in
E-mail
Password
Remember
Forgot password ?
Become a member for free
Sign up
Sign up
Settings
Settings
Dynamic quotes 
OFFON

MarketScreener Homepage  >  Equities  >  Nyse  >  International Business Machines Corporation    IBM

INTERNATIONAL BUSINESS MACHINES CORPORAT

(IBM)
  Report  
SummaryQuotesChartsNewsRatingsCalendarCompanyFinancialsConsensusRevisions 
News SummaryMost relevantAll newsOfficial PublicationsSector newsAnalyst Recommendations

IBM : Develops Full-text Digitization System for National Diet Library of Japan

share with twitter share with LinkedIn share with facebook
share via e-mail
0
08/01/2011 | 12:35pm EDT

Tokyo, Japan - 01 Aug 2011: IBM (NYSE: IBM) today announced that it is helping the National Diet Library of Japan -- the country's only national library -- digitize its literary artifacts on a massive scale to make them widely available and searchable online by all information seekers.

The prototype technology, created by IBM Research, allows full-text digitization of Japanese literature to be quickly realized through expansive recognition of Japanese characters and enabling users to collaboratively review and correct language characters, script and structure. Additionally, the full-text digitization system is designed to promote future international collaborations and standardization of libraries around the world.

“Nearly two decades ago in his book Digital Library, Dr. Makoto Nagao, the director of the National Diet Library, shared his vision that digitized and structured electronic books will dramatically change the role of libraries and the way knowledge will be shared and reused in our society,” said Dr. Hironobu Takagi, who led the development of the prototype technology at IBM Research – Tokyo. “Until now, the breadth of the characters and expressions within the Japanese language had posed a series of challenges to massive digitization. In order to enable this transfer of knowledge from print to online, we realized the need for both machine and human intelligence to understand information in every form.”

Compared to other languages, which rely on just a few dozen alphabetical characters, Japanese is extremely diverse in terms of script. In addition to syllabary characters, hiragana and katakana, Japanese includes about 10,000 kanji characters (including old characters, variants and 2,136 commonly used characters), in addition to ruby (a small Japanese syllabary character reading aid printed right next to a kanji) and mixed vertical and horizontal texts.

Aside from ensuring quality recognition of Japanese characters, IBM researchers aimed to optimize the amount of time needed to review and verify the accuracy of the digitized texts. By introducing unique collaborative tools via crowdsourcing, the technology allows many users to quickly pour through the texts and make corrections at a much higher rate of productivity and efficiency.

“Through collaboration technology and user tools, we now have the potential to populate a global collection of literature and information,” said Dr. Takagi. “From small community libraries to national institutions, people everywhere can leverage this standardized system to help preserve and share their cultural works for years to come.”

The architecture of the full-text digitization prototype system provides the following two key collaborative features:

  • Collaborative Correction -- This feature allows simultaneous corrections by multiple users via web browsers and improves the accuracy of optical character recognition (OCR). The collaborative character correction interface displays a group of OCR-scanned characters which were recognized as the same character, allowing reviewers to make a batch correction. The result of the correction work can be fed into the OCR engine as training data, so the system can learn from the mistakes to improve accuracy down the line.
  • Collaborative Data Structuring -- This feature is essential to digitizing texts for visually impaired people to read books using a voice browser. Reviewers are able to add structural information as well as correct read-out order, which are both supported by inference engines to reduce workload. For document structuring, the system automatically detects repetitive patterns and suggests similar structures throughout pages. To correct read-out order, the visual editor technology uses smoothly-connecting arrows to show in what order voice browsers would present content, allowing reviewers to go over reading flow at a glance and make necessary edits by simply dragging, dropping and re-arranging the arrows.

The full-text digitization prototype system was realized based on two streams of technologies. IBM researchers in Tokyo applied an innovative approach called Social Accessibility, which allows large groups of reviewers to work collaboratively via Web browsers regardless of location. Also, the COoperative eNgine for Correction of ExtRacted Text(CONCERT) technology -- developed by IBM Researchers in Haifa, Israel -- was leveraged to significantly improve productivity through the repetition of simple operations.

share with twitter share with LinkedIn share with facebook
share via e-mail
0
Latest news on INTERNATIONAL BUSINESS MAC
09/19INTERNATIONAL BUSINESS MACHINES : UNICOM Global delivers support for IBM® z/OSÂ..
AQ
09/19INTERNATIONAL BUSINESS MACHINES : GMEX Taps IBM Blockchain to Support Digital As..
PR
09/18INTERNATIONAL BUSINESS MACHINES : New IBM Inventory Visibility Helps Improve Omn..
PR
09/18IBM : Opens Quantum Computation Center in New York; Brings World's Largest Fleet..
PR
09/18INTERNATIONAL BUSINESS MACHINES : IBM Opens Quantum Computation Center in New Yo..
AQ
09/17IBM : Works With City of Los Angeles to Combat Cybercrime
PR
09/17INTERNATIONAL BUSINESS MACHINES : Top five open source-powered solutions to miti..
PR
09/16INTERNATIONAL BUSINESS MACHINES : Vector Limited - Outsmarting Auckland storms
AQ
09/13INTERNATIONAL BUSINESS MACHINES : Primerica Taps IBM to Modernize Applications i..
PU
09/13THE UNIVERSITY OF THE HIGHLANDS AND : University collaborates with IBM to launc..
AQ
More news
Financials (USD)
Sales 2019 77 894 M
EBIT 2019 12 712 M
Net income 2019 10 350 M
Debt 2019 39 695 M
Yield 2019 4,48%
P/E ratio 2019 12,5x
P/E ratio 2020 11,7x
EV / Sales2019 2,14x
EV / Sales2020 1,98x
Capitalization 127 B
Chart INTERNATIONAL BUSINESS MACHINES CORPORATION
Duration : Period :
International Business Machines Corporation Technical Analysis Chart | MarketScreener
Full-screen chart
Technical analysis trends INTERNATIONAL BUSINESS MAC
Short TermMid-TermLong Term
TrendsBullishNeutralBullish
Income Statement Evolution
Consensus
Sell
Buy
Mean consensus HOLD
Number of Analysts 24
Average target price 153,65  $
Last Close Price 142,97  $
Spread / Highest target 22,4%
Spread / Average Target 7,47%
Spread / Lowest Target -20,3%
EPS Revisions
Managers
NameTitle
Virginia M. Rometty Chairman, President & Chief Executive Officer
James J. Kavanaugh Chief Financial Officer & Senior Vice President
John E. Kelly Senior VP-Cognitive Solutions & Research
Martin Jetter SVP-Global Technology Services
Sidney Taurel Independent Director
Sector and Competitors
1st jan.Capitalization (M$)
INTERNATIONAL BUSINESS MACHINES CORPORATION25.78%126 654
ACCENTURE37.67%123 688
TATA CONSULTANCY SERVICES12.13%112 759
AUTOMATIC DATA PROCESSING21.13%68 463
VMWARE, INC.9.64%61 438
INFOSYS LTD26.15%49 423