annotate src/sord-0.12.0/tests/UTF-8.ttl @ 53:e1712f7d74a4

Rebuild with DW2 exception handling to match Qt
author Chris Cannam
date Thu, 27 Oct 2016 10:26:57 +0100
parents c7265573341e
children
rev   line source
Chris@0 1 @prefix rdfs: <http://www.w3.org/2000/01/rdf-schema#> .
Chris@0 2
Chris@0 3 <> rdfs:comment """
Chris@0 4 UTF-8 encoded sample plain-text file
Chris@0 5 ‾‾‾‾‾‾‾‾‾‾‾‾‾‾‾‾‾‾‾‾‾‾‾‾‾‾‾‾‾‾‾‾‾‾‾‾
Chris@0 6
Chris@0 7 Markus Kuhn [ˈmaʳkʊs kuːn] <http://www.cl.cam.ac.uk/~mgk25/> — 2002-07-25
Chris@0 8
Chris@0 9
Chris@0 10 The ASCII compatible UTF-8 encoding used in this plain-text file
Chris@0 11 is defined in Unicode, ISO 10646-1, and RFC 2279.
Chris@0 12
Chris@0 13
Chris@0 14 Using Unicode/UTF-8, you can write in emails and source code things such as
Chris@0 15
Chris@0 16 Mathematics and sciences:
Chris@0 17
Chris@0 18 ∮ E⋅da = Q, n → ∞, ∑ f(i) = ∏ g(i), ⎧⎡⎛┌─────┐⎞⎤⎫
Chris@0 19 ⎪⎢⎜│a²+b³ ⎟⎥⎪
Chris@0 20 ∀x∈ℝ: ⌈x⌉ = −⌊−x⌋, α ∧ ¬β = ¬(¬α ∨ β), ⎪⎢⎜│───── ⎟⎥⎪
Chris@0 21 ⎪⎢⎜⎷ c₈ ⎟⎥⎪
Chris@0 22 ℕ ⊆ ℕ₀ ⊂ ℤ ⊂ ℚ ⊂ ℝ ⊂ ℂ, ⎨⎢⎜ ⎟⎥⎬
Chris@0 23 ⎪⎢⎜ ∞ ⎟⎥⎪
Chris@0 24 ⊥ < a ≠ b ≡ c ≤ d ≪ ⊤ ⇒ (⟦A⟧ ⇔ ⟪B⟫), ⎪⎢⎜ ⎲ ⎟⎥⎪
Chris@0 25 ⎪⎢⎜ ⎳aⁱ-bⁱ⎟⎥⎪
Chris@0 26 2H₂ + O₂ ⇌ 2H₂O, R = 4.7 kΩ, ⌀ 200 mm ⎩⎣⎝i=1 ⎠⎦⎭
Chris@0 27
Chris@0 28 Linguistics and dictionaries:
Chris@0 29
Chris@0 30 ði ıntəˈnæʃənəl fəˈnɛtık əsoʊsiˈeıʃn
Chris@0 31 Y [ˈʏpsilɔn], Yen [jɛn], Yoga [ˈjoːgɑ]
Chris@0 32
Chris@0 33 APL:
Chris@0 34
Chris@0 35 ((V⍳V)=⍳⍴V)/V←,V ⌷←⍳→⍴∆∇⊃‾⍎⍕⌈
Chris@0 36
Chris@0 37 Nicer typography in plain text files:
Chris@0 38
Chris@0 39 ╔══════════════════════════════════════════╗
Chris@0 40 ║ ║
Chris@0 41 ║ • ‘single’ and “double” quotes ║
Chris@0 42 ║ ║
Chris@0 43 ║ • Curly apostrophes: “We’ve been here” ║
Chris@0 44 ║ ║
Chris@0 45 ║ • Latin-1 apostrophe and accents: '´` ║
Chris@0 46 ║ ║
Chris@0 47 ║ • ‚deutsche‘ „Anführungszeichen“ ║
Chris@0 48 ║ ║
Chris@0 49 ║ • †, ‡, ‰, •, 3–4, —, −5/+5, ™, … ║
Chris@0 50 ║ ║
Chris@0 51 ║ • ASCII safety test: 1lI|, 0OD, 8B ║
Chris@0 52 ║ ╭─────────╮ ║
Chris@0 53 ║ • the euro symbol: │ 14.95 € │ ║
Chris@0 54 ║ ╰─────────╯ ║
Chris@0 55 ╚══════════════════════════════════════════╝
Chris@0 56
Chris@0 57 Combining characters:
Chris@0 58
Chris@0 59 STARGΛ̊TE SG-1, a = v̇ = r̈, a⃑ ⊥ b⃑
Chris@0 60
Chris@0 61 Greek (in Polytonic):
Chris@0 62
Chris@0 63 The Greek anthem:
Chris@0 64
Chris@0 65 Σὲ γνωρίζω ἀπὸ τὴν κόψη
Chris@0 66 τοῦ σπαθιοῦ τὴν τρομερή,
Chris@0 67 σὲ γνωρίζω ἀπὸ τὴν ὄψη
Chris@0 68 ποὺ μὲ βία μετράει τὴ γῆ.
Chris@0 69
Chris@0 70 ᾿Απ᾿ τὰ κόκκαλα βγαλμένη
Chris@0 71 τῶν ῾Ελλήνων τὰ ἱερά
Chris@0 72 καὶ σὰν πρῶτα ἀνδρειωμένη
Chris@0 73 χαῖρε, ὦ χαῖρε, ᾿Ελευθεριά!
Chris@0 74
Chris@0 75 From a speech of Demosthenes in the 4th century BC:
Chris@0 76
Chris@0 77 Οὐχὶ ταὐτὰ παρίσταταί μοι γιγνώσκειν, ὦ ἄνδρες ᾿Αθηναῖοι,
Chris@0 78 ὅταν τ᾿ εἰς τὰ πράγματα ἀποβλέψω καὶ ὅταν πρὸς τοὺς
Chris@0 79 λόγους οὓς ἀκούω· τοὺς μὲν γὰρ λόγους περὶ τοῦ
Chris@0 80 τιμωρήσασθαι Φίλιππον ὁρῶ γιγνομένους, τὰ δὲ πράγματ᾿
Chris@0 81 εἰς τοῦτο προήκοντα, ὥσθ᾿ ὅπως μὴ πεισόμεθ᾿ αὐτοὶ
Chris@0 82 πρότερον κακῶς σκέψασθαι δέον. οὐδέν οὖν ἄλλο μοι δοκοῦσιν
Chris@0 83 οἱ τὰ τοιαῦτα λέγοντες ἢ τὴν ὑπόθεσιν, περὶ ἧς βουλεύεσθαι,
Chris@0 84 οὐχὶ τὴν οὖσαν παριστάντες ὑμῖν ἁμαρτάνειν. ἐγὼ δέ, ὅτι μέν
Chris@0 85 ποτ᾿ ἐξῆν τῇ πόλει καὶ τὰ αὑτῆς ἔχειν ἀσφαλῶς καὶ Φίλιππον
Chris@0 86 τιμωρήσασθαι, καὶ μάλ᾿ ἀκριβῶς οἶδα· ἐπ᾿ ἐμοῦ γάρ, οὐ πάλαι
Chris@0 87 γέγονεν ταῦτ᾿ ἀμφότερα· νῦν μέντοι πέπεισμαι τοῦθ᾿ ἱκανὸν
Chris@0 88 προλαβεῖν ἡμῖν εἶναι τὴν πρώτην, ὅπως τοὺς συμμάχους
Chris@0 89 σώσομεν. ἐὰν γὰρ τοῦτο βεβαίως ὑπάρξῃ, τότε καὶ περὶ τοῦ
Chris@0 90 τίνα τιμωρήσεταί τις καὶ ὃν τρόπον ἐξέσται σκοπεῖν· πρὶν δὲ
Chris@0 91 τὴν ἀρχὴν ὀρθῶς ὑποθέσθαι, μάταιον ἡγοῦμαι περὶ τῆς
Chris@0 92 τελευτῆς ὁντινοῦν ποιεῖσθαι λόγον.
Chris@0 93
Chris@0 94 Δημοσθένους, Γ´ ᾿Ολυνθιακὸς
Chris@0 95
Chris@0 96 Georgian:
Chris@0 97
Chris@0 98 From a Unicode conference invitation:
Chris@0 99
Chris@0 100 გთხოვთ ახლავე გაიაროთ რეგისტრაცია Unicode-ის მეათე საერთაშორისო
Chris@0 101 კონფერენციაზე დასასწრებად, რომელიც გაიმართება 10-12 მარტს,
Chris@0 102 ქ. მაინცში, გერმანიაში. კონფერენცია შეჰკრებს ერთად მსოფლიოს
Chris@0 103 ექსპერტებს ისეთ დარგებში როგორიცაა ინტერნეტი და Unicode-ი,
Chris@0 104 ინტერნაციონალიზაცია და ლოკალიზაცია, Unicode-ის გამოყენება
Chris@0 105 ოპერაციულ სისტემებსა, და გამოყენებით პროგრამებში, შრიფტებში,
Chris@0 106 ტექსტების დამუშავებასა და მრავალენოვან კომპიუტერულ სისტემებში.
Chris@0 107
Chris@0 108 Russian:
Chris@0 109
Chris@0 110 From a Unicode conference invitation:
Chris@0 111
Chris@0 112 Зарегистрируйтесь сейчас на Десятую Международную Конференцию по
Chris@0 113 Unicode, которая состоится 10-12 марта 1997 года в Майнце в Германии.
Chris@0 114 Конференция соберет широкий круг экспертов по вопросам глобального
Chris@0 115 Интернета и Unicode, локализации и интернационализации, воплощению и
Chris@0 116 применению Unicode в различных операционных системах и программных
Chris@0 117 приложениях, шрифтах, верстке и многоязычных компьютерных системах.
Chris@0 118
Chris@0 119 Thai (UCS Level 2):
Chris@0 120
Chris@0 121 Excerpt from a poetry on The Romance of The Three Kingdoms (a Chinese
Chris@0 122 classic 'San Gua'):
Chris@0 123
Chris@0 124 [----------------------------|------------------------]
Chris@0 125 ๏ แผ่นดินฮั่นเสื่อมโทรมแสนสังเวช พระปกเกศกองบู๊กู้ขึ้นใหม่
Chris@0 126 สิบสองกษัตริย์ก่อนหน้าแลถัดไป สององค์ไซร้โง่เขลาเบาปัญญา
Chris@0 127 ทรงนับถือขันทีเป็นที่พึ่ง บ้านเมืองจึงวิปริตเป็นนักหนา
Chris@0 128 โฮจิ๋นเรียกทัพทั่วหัวเมืองมา หมายจะฆ่ามดชั่วตัวสำคัญ
Chris@0 129 เหมือนขับไสไล่เสือจากเคหา รับหมาป่าเข้ามาเลยอาสัญ
Chris@0 130 ฝ่ายอ้องอุ้นยุแยกให้แตกกัน ใช้สาวนั้นเป็นชนวนชื่นชวนใจ
Chris@0 131 พลันลิฉุยกุยกีกลับก่อเหตุ ช่างอาเพศจริงหนาฟ้าร้องไห้
Chris@0 132 ต้องรบราฆ่าฟันจนบรรลัย ฤๅหาใครค้ำชูกู้บรรลังก์ ฯ
Chris@0 133
Chris@0 134 (The above is a two-column text. If combining characters are handled
Chris@0 135 correctly, the lines of the second column should be aligned with the
Chris@0 136 | character above.)
Chris@0 137
Chris@0 138 Ethiopian:
Chris@0 139
Chris@0 140 Proverbs in the Amharic language:
Chris@0 141
Chris@0 142 ሰማይ አይታረስ ንጉሥ አይከሰስ።
Chris@0 143 ብላ ካለኝ እንደአባቴ በቆመጠኝ።
Chris@0 144 ጌጥ ያለቤቱ ቁምጥና ነው።
Chris@0 145 ደሀ በሕልሙ ቅቤ ባይጠጣ ንጣት በገደለው።
Chris@0 146 የአፍ ወለምታ በቅቤ አይታሽም።
Chris@0 147 አይጥ በበላ ዳዋ ተመታ።
Chris@0 148 ሲተረጉሙ ይደረግሙ።
Chris@0 149 ቀስ በቀስ፥ ዕንቁላል በእግሩ ይሄዳል።
Chris@0 150 ድር ቢያብር አንበሳ ያስር።
Chris@0 151 ሰው እንደቤቱ እንጅ እንደ ጉረቤቱ አይተዳደርም።
Chris@0 152 እግዜር የከፈተውን ጉሮሮ ሳይዘጋው አይድርም።
Chris@0 153 የጎረቤት ሌባ፥ ቢያዩት ይስቅ ባያዩት ያጠልቅ።
Chris@0 154 ሥራ ከመፍታት ልጄን ላፋታት።
Chris@0 155 ዓባይ ማደሪያ የለው፥ ግንድ ይዞ ይዞራል።
Chris@0 156 የእስላም አገሩ መካ የአሞራ አገሩ ዋርካ።
Chris@0 157 ተንጋሎ ቢተፉ ተመልሶ ባፉ።
Chris@0 158 ወዳጅህ ማር ቢሆን ጨርስህ አትላሰው።
Chris@0 159 እግርህን በፍራሽህ ልክ ዘርጋ።
Chris@0 160
Chris@0 161 Runes:
Chris@0 162
Chris@0 163 ᚻᛖ ᚳᚹᚫᚦ ᚦᚫᛏ ᚻᛖ ᛒᚢᛞᛖ ᚩᚾ ᚦᚫᛗ ᛚᚪᚾᛞᛖ ᚾᚩᚱᚦᚹᛖᚪᚱᛞᚢᛗ ᚹᛁᚦ ᚦᚪ ᚹᛖᛥᚫ
Chris@0 164
Chris@0 165 (Old English, which transcribed into Latin reads 'He cwaeth that he
Chris@0 166 bude thaem lande northweardum with tha Westsae.' and means 'He said
Chris@0 167 that he lived in the northern land near the Western Sea.')
Chris@0 168
Chris@0 169 Braille:
Chris@0 170
Chris@0 171 ⡌⠁⠧⠑ ⠼⠁⠒ ⡍⠜⠇⠑⠹⠰⠎ ⡣⠕⠌
Chris@0 172
Chris@0 173 ⡍⠜⠇⠑⠹ ⠺⠁⠎ ⠙⠑⠁⠙⠒ ⠞⠕ ⠃⠑⠛⠔ ⠺⠊⠹⠲ ⡹⠻⠑ ⠊⠎ ⠝⠕ ⠙⠳⠃⠞
Chris@0 174 ⠱⠁⠞⠑⠧⠻ ⠁⠃⠳⠞ ⠹⠁⠞⠲ ⡹⠑ ⠗⠑⠛⠊⠌⠻ ⠕⠋ ⠙⠊⠎ ⠃⠥⠗⠊⠁⠇ ⠺⠁⠎
Chris@0 175 ⠎⠊⠛⠝⠫ ⠃⠹ ⠹⠑ ⠊⠇⠻⠛⠹⠍⠁⠝⠂ ⠹⠑ ⠊⠇⠻⠅⠂ ⠹⠑ ⠥⠝⠙⠻⠞⠁⠅⠻⠂
Chris@0 176 ⠁⠝⠙ ⠹⠑ ⠡⠊⠑⠋ ⠍⠳⠗⠝⠻⠲ ⡎⠊⠗⠕⠕⠛⠑ ⠎⠊⠛⠝⠫ ⠊⠞⠲ ⡁⠝⠙
Chris@0 177 ⡎⠊⠗⠕⠕⠛⠑⠰⠎ ⠝⠁⠍⠑ ⠺⠁⠎ ⠛⠕⠕⠙ ⠥⠏⠕⠝ ⠰⡡⠁⠝⠛⠑⠂ ⠋⠕⠗ ⠁⠝⠹⠹⠔⠛ ⠙⠑
Chris@0 178 ⠡⠕⠎⠑ ⠞⠕ ⠏⠥⠞ ⠙⠊⠎ ⠙⠁⠝⠙ ⠞⠕⠲
Chris@0 179
Chris@0 180 ⡕⠇⠙ ⡍⠜⠇⠑⠹ ⠺⠁⠎ ⠁⠎ ⠙⠑⠁⠙ ⠁⠎ ⠁ ⠙⠕⠕⠗⠤⠝⠁⠊⠇⠲
Chris@0 181
Chris@0 182 ⡍⠔⠙⠖ ⡊ ⠙⠕⠝⠰⠞ ⠍⠑⠁⠝ ⠞⠕ ⠎⠁⠹ ⠹⠁⠞ ⡊ ⠅⠝⠪⠂ ⠕⠋ ⠍⠹
Chris@0 183 ⠪⠝ ⠅⠝⠪⠇⠫⠛⠑⠂ ⠱⠁⠞ ⠹⠻⠑ ⠊⠎ ⠏⠜⠞⠊⠊⠥⠇⠜⠇⠹ ⠙⠑⠁⠙ ⠁⠃⠳⠞
Chris@0 184 ⠁ ⠙⠕⠕⠗⠤⠝⠁⠊⠇⠲ ⡊ ⠍⠊⠣⠞ ⠙⠁⠧⠑ ⠃⠑⠲ ⠔⠊⠇⠔⠫⠂ ⠍⠹⠎⠑⠇⠋⠂ ⠞⠕
Chris@0 185 ⠗⠑⠛⠜⠙ ⠁ ⠊⠕⠋⠋⠔⠤⠝⠁⠊⠇ ⠁⠎ ⠹⠑ ⠙⠑⠁⠙⠑⠌ ⠏⠊⠑⠊⠑ ⠕⠋ ⠊⠗⠕⠝⠍⠕⠝⠛⠻⠹
Chris@0 186 ⠔ ⠹⠑ ⠞⠗⠁⠙⠑⠲ ⡃⠥⠞ ⠹⠑ ⠺⠊⠎⠙⠕⠍ ⠕⠋ ⠳⠗ ⠁⠝⠊⠑⠌⠕⠗⠎
Chris@0 187 ⠊⠎ ⠔ ⠹⠑ ⠎⠊⠍⠊⠇⠑⠆ ⠁⠝⠙ ⠍⠹ ⠥⠝⠙⠁⠇⠇⠪⠫ ⠙⠁⠝⠙⠎
Chris@0 188 ⠩⠁⠇⠇ ⠝⠕⠞ ⠙⠊⠌⠥⠗⠃ ⠊⠞⠂ ⠕⠗ ⠹⠑ ⡊⠳⠝⠞⠗⠹⠰⠎ ⠙⠕⠝⠑ ⠋⠕⠗⠲ ⡹⠳
Chris@0 189 ⠺⠊⠇⠇ ⠹⠻⠑⠋⠕⠗⠑ ⠏⠻⠍⠊⠞ ⠍⠑ ⠞⠕ ⠗⠑⠏⠑⠁⠞⠂ ⠑⠍⠏⠙⠁⠞⠊⠊⠁⠇⠇⠹⠂ ⠹⠁⠞
Chris@0 190 ⡍⠜⠇⠑⠹ ⠺⠁⠎ ⠁⠎ ⠙⠑⠁⠙ ⠁⠎ ⠁ ⠙⠕⠕⠗⠤⠝⠁⠊⠇⠲
Chris@0 191
Chris@0 192 (The first couple of paragraphs of "A Christmas Carol" by Dickens)
Chris@0 193
Chris@0 194 Compact font selection example text:
Chris@0 195
Chris@0 196 ABCDEFGHIJKLMNOPQRSTUVWXYZ /0123456789
Chris@0 197 abcdefghijklmnopqrstuvwxyz £©µÀÆÖÞßéöÿ
Chris@0 198 –—‘“”„†•…‰™œŠŸž€ ΑΒΓΔΩαβγδω АБВГДабвгд
Chris@0 199 ∀∂∈ℝ∧∪≡∞ ↑↗↨↻⇣ ┐┼╔╘░►☺♀ fi�⑀₂ἠḂӥẄɐː⍎אԱა
Chris@0 200
Chris@0 201 Greetings in various languages:
Chris@0 202
Chris@0 203 Hello world, Καλημέρα κόσμε, コンニチハ
Chris@0 204
Chris@0 205 Box drawing alignment tests: █
Chris@0 206
Chris@0 207 ╔══╦══╗ ┌──┬──┐ ╭──┬──╮ ╭──┬──╮ ┏━━┳━━┓ ┎┒┏┑ ╷ ╻ ┏┯┓ ┌┰┐ ▊ ╱╲╱╲╳╳╳
Chris@0 208 ║┌─╨─┐║ │╔═╧═╗│ │╒═╪═╕│ │╓─╁─╖│ ┃┌─╂─┐┃ ┗╃╄┙ ╶┼╴╺╋╸┠┼┨ ┝╋┥ ▋ ╲╱╲╱╳╳╳
Chris@0 209 ║│╲ ╱│║ │║ ║│ ││ │ ││ │║ ┃ ║│ ┃│ ╿ │┃ ┍╅╆┓ ╵ ╹ ┗┷┛ └┸┘ ▌ ╱╲╱╲╳╳╳
Chris@0 210 ╠╡ ╳ ╞╣ ├╢ ╟┤ ├┼─┼─┼┤ ├╫─╂─╫┤ ┣┿╾┼╼┿┫ ┕┛┖┚ ┌┄┄┐ ╎ ┏┅┅┓ ┋ ▍ ╲╱╲╱╳╳╳
Chris@0 211 ║│╱ ╲│║ │║ ║│ ││ │ ││ │║ ┃ ║│ ┃│ ╽ │┃ ░░▒▒▓▓██ ┊ ┆ ╎ ╏ ┇ ┋ ▎
Chris@0 212 ║└─╥─┘║ │╚═╤═╝│ │╘═╪═╛│ │╙─╀─╜│ ┃└─╂─┘┃ ░░▒▒▓▓██ ┊ ┆ ╎ ╏ ┇ ┋ ▏
Chris@0 213 ╚══╩══╝ └──┴──┘ ╰──┴──╯ ╰──┴──╯ ┗━━┻━━┛ ▗▄▖▛▀▜ └╌╌┘ ╎ ┗╍╍┛ ┋ ▁▂▃▄▅▆▇█
Chris@0 214 ▝▀▘▙▄▟
Chris@0 215 """ .
Chris@0 216 <> rdfs:comment """
Chris@0 217 Two byte Unicode escape: \u00E0
Chris@0 218 Largest Unicode escape in Turtle: \U0010FFFF
Chris@0 219 """ .