Text Boundary Analysis - PowerPoint PPT Presentation

text boundary analysis n.
Download
Skip this Video
Loading SlideShow in 5 Seconds..
Text Boundary Analysis PowerPoint Presentation
Download Presentation
Text Boundary Analysis

play fullscreen
1 / 154
Text Boundary Analysis
123 Views
Download Presentation
haley
Download Presentation

Text Boundary Analysis

- - - - - - - - - - - - - - - - - - - - - - - - - - - E N D - - - - - - - - - - - - - - - - - - - - - - - - - - -
Presentation Transcript

  1. Text Boundary Analysis Eric Mader Advisory Software Engineer IBM

  2. Where do I break lines? The rain in Spain stays mainly on the plain.

  3. Where do I break lines? The rain in Spain stays mainly on the plain. 您有坦率和誠實的聲譽。

  4. Where do I break lines? The rain in Spain stays mainly on the plain. 您有坦率和誠實的聲譽。 ด่ๅแรงฃนึ๓อัตราลูกจ้างใหม่ให้๓๕

  5. Even in English, this can be hard You owe me $1,234.56... I think.

  6. Even in English, this can be hard You owe me $1,234.56... I think.

  7. Word wrapping vs word selection Word wrapping: Some characters’ behavior is context-dependent.

  8. Word wrapping vs word selection Word wrapping: Some characters’ behavior is context-dependent. Searching by words: Some characters’ behavior is context-dependent.

  9. Analysis by pairs second ltr dgt sp pun ltr dgt first sp X X X pun

  10. Analysis by pairs second ltr dgt sp pun ltr dgt first sp X X X pun

  11. Analysis by pairs second ltr dgt sp pun - ltr dgt first sp X X X X pun X - X

  12. Analysis by pairs second ltr dgt sp pun - ltr dgt first sp X X X X pun X - X

  13. Analysis by pairs second ltr dgt sp pun - nbs ltr dgt first sp X X X X pun X - X nbs

  14. Analysis by pairs second ltr dgt sp pun - nbs ltr dgt first sp X X X X pun X - X nbs

  15. Analysis by pairs second ltr dgt sp pun - nbs kji X ltr X dgt first sp X X X X X X pun X X - X nbs X X X kji X X

  16. Where pairs break down A break position can depend on more than two characters: You owe me $1,234.56... I think.

  17. Where pairs break down A break position can depend on more than two characters: You owe me $1,234.56... I think. 4.5

  18. Where pairs break down A break position can depend on more than two characters: You owe me $1,234.56... I think. 6..

  19. Where pairs break down Sentence boundaries require even more lookahead: He asked, “How tall are you?” I’m about 6 ft. tall. “Wow!”

  20. Where pairs break down Sentence boundaries require even more lookahead: He asked, “How tall are you?” I’m about 6 ft. tall. “Wow!”

  21. Where pairs break down Sentence boundaries require even more lookahead: He asked, “How tall are you?” I’m about 6 ft. tall. “Wow!”

  22. Where pairs break down Sentence boundaries require even more lookahead: He asked, “How tall are you?” I’m about 6 ft. tall. “Wow!”

  23. Where pairs break down Sentence boundaries require even more lookahead: He asked, “How tall are you?” I’m about 6 ft. tall. “Wow!”

  24. An example • If not otherwise mentioned, each character is a “word” unto itself. • A run of letters constitutes a “word” and is kept together. Certain punctuation marks may appear inside a word, but only if they have a letter on each side. • A run of digits constitutes a “number” and is kept together. Certain punctuation marks may appear inside a number, but only if they have a digit on each side. In addition, a number may have certain optional prefix and suffix characters. • If a “word” and a “number” appear in succession with nothing between them, they’re kept together.

  25. The state-machine approach $ start A 0 % . ’

  26. The state-machine approach $ start A 0 % . ’

  27. The state-machine approach $ start A 0 % . ’

  28. The state-machine approach $ start A 0 % . ’

  29. The state-machine approach $ start A 0 % . ’

  30. The state-machine approach $ start A 0 % . ’

  31. The state-machine approach $ start A 0 % . ’

  32. The state-machine approach $ start A 0 % . ’

  33. The state-machine approach $ start A 0 % . ’

  34. The state-machine approach $1,234.56... $ start A 0 % . ’

  35. The state-machine approach $1,234.56... $ start A 0 % . ’

  36. The state-machine approach $1,234.56... $ start A 0 % . ’

  37. The state-machine approach $1,234.56... $ start A 0 % . ’

  38. The state-machine approach $1,234.56... $ start A 0 % . ’

  39. The state-machine approach $1,234.56... $ start A 0 % . ’

  40. The state-machine approach $1,234.56... $ start A 0 % . ’

  41. The state-machine approach $1,234.56... $ start A 0 % . ’

  42. The state-machine approach $1,234.56... $ start A 0 % . ’

  43. The state-machine approach $1,234.56... $ start A 0 % . ’

  44. The state-machine approach $1,234.56... $ start A 0 % . ’

  45. The state-machine approach $1,234.56... $ start A 0 % . ’

  46. The state-machine approach $1,234.56... $ start A 0 % . ’

  47. Limitations 1992–1996

  48. Limitations 1992–1996

  49. Limitations –1996

  50. Limitations 1992–1996