annotate src/serd-0.18.2/tests/UTF-8.ttl @ 164:9fa11135915a

This Mac compatibility fix appears to work, resolving SourceForge issue #273 Port Audio Fail on MacOS Catalina (10.15) - merging back
author Chris Cannam <cannam@all-day-breakfast.com>
date Thu, 31 Oct 2019 13:20:41 +0000
parents 545efbb81310
children
rev   line source
cannam@85 1 @prefix rdfs: <http://www.w3.org/2000/01/rdf-schema#> .
cannam@85 2
cannam@85 3 <> rdfs:comment """
cannam@85 4 UTF-8 encoded sample plain-text file
cannam@85 5 ‾‾‾‾‾‾‾‾‾‾‾‾‾‾‾‾‾‾‾‾‾‾‾‾‾‾‾‾‾‾‾‾‾‾‾‾
cannam@85 6
cannam@85 7 Markus Kuhn [ˈmaʳkʊs kuːn] <http://www.cl.cam.ac.uk/~mgk25/> — 2002-07-25
cannam@85 8
cannam@85 9
cannam@85 10 The ASCII compatible UTF-8 encoding used in this plain-text file
cannam@85 11 is defined in Unicode, ISO 10646-1, and RFC 2279.
cannam@85 12
cannam@85 13
cannam@85 14 Using Unicode/UTF-8, you can write in emails and source code things such as
cannam@85 15
cannam@85 16 Mathematics and sciences:
cannam@85 17
cannam@85 18 ∮ E⋅da = Q, n → ∞, ∑ f(i) = ∏ g(i), ⎧⎡⎛┌─────┐⎞⎤⎫
cannam@85 19 ⎪⎢⎜│a²+b³ ⎟⎥⎪
cannam@85 20 ∀x∈ℝ: ⌈x⌉ = −⌊−x⌋, α ∧ ¬β = ¬(¬α ∨ β), ⎪⎢⎜│───── ⎟⎥⎪
cannam@85 21 ⎪⎢⎜⎷ c₈ ⎟⎥⎪
cannam@85 22 ℕ ⊆ ℕ₀ ⊂ ℤ ⊂ ℚ ⊂ ℝ ⊂ ℂ, ⎨⎢⎜ ⎟⎥⎬
cannam@85 23 ⎪⎢⎜ ∞ ⎟⎥⎪
cannam@85 24 ⊥ < a ≠ b ≡ c ≤ d ≪ ⊤ ⇒ (⟦A⟧ ⇔ ⟪B⟫), ⎪⎢⎜ ⎲ ⎟⎥⎪
cannam@85 25 ⎪⎢⎜ ⎳aⁱ-bⁱ⎟⎥⎪
cannam@85 26 2H₂ + O₂ ⇌ 2H₂O, R = 4.7 kΩ, ⌀ 200 mm ⎩⎣⎝i=1 ⎠⎦⎭
cannam@85 27
cannam@85 28 Linguistics and dictionaries:
cannam@85 29
cannam@85 30 ði ıntəˈnæʃənəl fəˈnɛtık əsoʊsiˈeıʃn
cannam@85 31 Y [ˈʏpsilɔn], Yen [jɛn], Yoga [ˈjoːgɑ]
cannam@85 32
cannam@85 33 APL:
cannam@85 34
cannam@85 35 ((V⍳V)=⍳⍴V)/V←,V ⌷←⍳→⍴∆∇⊃‾⍎⍕⌈
cannam@85 36
cannam@85 37 Nicer typography in plain text files:
cannam@85 38
cannam@85 39 ╔══════════════════════════════════════════╗
cannam@85 40 ║ ║
cannam@85 41 ║ • ‘single’ and “double” quotes ║
cannam@85 42 ║ ║
cannam@85 43 ║ • Curly apostrophes: “We’ve been here” ║
cannam@85 44 ║ ║
cannam@85 45 ║ • Latin-1 apostrophe and accents: '´` ║
cannam@85 46 ║ ║
cannam@85 47 ║ • ‚deutsche‘ „Anführungszeichen“ ║
cannam@85 48 ║ ║
cannam@85 49 ║ • †, ‡, ‰, •, 3–4, —, −5/+5, ™, … ║
cannam@85 50 ║ ║
cannam@85 51 ║ • ASCII safety test: 1lI|, 0OD, 8B ║
cannam@85 52 ║ ╭─────────╮ ║
cannam@85 53 ║ • the euro symbol: │ 14.95 € │ ║
cannam@85 54 ║ ╰─────────╯ ║
cannam@85 55 ╚══════════════════════════════════════════╝
cannam@85 56
cannam@85 57 Combining characters:
cannam@85 58
cannam@85 59 STARGΛ̊TE SG-1, a = v̇ = r̈, a⃑ ⊥ b⃑
cannam@85 60
cannam@85 61 Greek (in Polytonic):
cannam@85 62
cannam@85 63 The Greek anthem:
cannam@85 64
cannam@85 65 Σὲ γνωρίζω ἀπὸ τὴν κόψη
cannam@85 66 τοῦ σπαθιοῦ τὴν τρομερή,
cannam@85 67 σὲ γνωρίζω ἀπὸ τὴν ὄψη
cannam@85 68 ποὺ μὲ βία μετράει τὴ γῆ.
cannam@85 69
cannam@85 70 ᾿Απ᾿ τὰ κόκκαλα βγαλμένη
cannam@85 71 τῶν ῾Ελλήνων τὰ ἱερά
cannam@85 72 καὶ σὰν πρῶτα ἀνδρειωμένη
cannam@85 73 χαῖρε, ὦ χαῖρε, ᾿Ελευθεριά!
cannam@85 74
cannam@85 75 From a speech of Demosthenes in the 4th century BC:
cannam@85 76
cannam@85 77 Οὐχὶ ταὐτὰ παρίσταταί μοι γιγνώσκειν, ὦ ἄνδρες ᾿Αθηναῖοι,
cannam@85 78 ὅταν τ᾿ εἰς τὰ πράγματα ἀποβλέψω καὶ ὅταν πρὸς τοὺς
cannam@85 79 λόγους οὓς ἀκούω· τοὺς μὲν γὰρ λόγους περὶ τοῦ
cannam@85 80 τιμωρήσασθαι Φίλιππον ὁρῶ γιγνομένους, τὰ δὲ πράγματ᾿
cannam@85 81 εἰς τοῦτο προήκοντα, ὥσθ᾿ ὅπως μὴ πεισόμεθ᾿ αὐτοὶ
cannam@85 82 πρότερον κακῶς σκέψασθαι δέον. οὐδέν οὖν ἄλλο μοι δοκοῦσιν
cannam@85 83 οἱ τὰ τοιαῦτα λέγοντες ἢ τὴν ὑπόθεσιν, περὶ ἧς βουλεύεσθαι,
cannam@85 84 οὐχὶ τὴν οὖσαν παριστάντες ὑμῖν ἁμαρτάνειν. ἐγὼ δέ, ὅτι μέν
cannam@85 85 ποτ᾿ ἐξῆν τῇ πόλει καὶ τὰ αὑτῆς ἔχειν ἀσφαλῶς καὶ Φίλιππον
cannam@85 86 τιμωρήσασθαι, καὶ μάλ᾿ ἀκριβῶς οἶδα· ἐπ᾿ ἐμοῦ γάρ, οὐ πάλαι
cannam@85 87 γέγονεν ταῦτ᾿ ἀμφότερα· νῦν μέντοι πέπεισμαι τοῦθ᾿ ἱκανὸν
cannam@85 88 προλαβεῖν ἡμῖν εἶναι τὴν πρώτην, ὅπως τοὺς συμμάχους
cannam@85 89 σώσομεν. ἐὰν γὰρ τοῦτο βεβαίως ὑπάρξῃ, τότε καὶ περὶ τοῦ
cannam@85 90 τίνα τιμωρήσεταί τις καὶ ὃν τρόπον ἐξέσται σκοπεῖν· πρὶν δὲ
cannam@85 91 τὴν ἀρχὴν ὀρθῶς ὑποθέσθαι, μάταιον ἡγοῦμαι περὶ τῆς
cannam@85 92 τελευτῆς ὁντινοῦν ποιεῖσθαι λόγον.
cannam@85 93
cannam@85 94 Δημοσθένους, Γ´ ᾿Ολυνθιακὸς
cannam@85 95
cannam@85 96 Georgian:
cannam@85 97
cannam@85 98 From a Unicode conference invitation:
cannam@85 99
cannam@85 100 გთხოვთ ახლავე გაიაროთ რეგისტრაცია Unicode-ის მეათე საერთაშორისო
cannam@85 101 კონფერენციაზე დასასწრებად, რომელიც გაიმართება 10-12 მარტს,
cannam@85 102 ქ. მაინცში, გერმანიაში. კონფერენცია შეჰკრებს ერთად მსოფლიოს
cannam@85 103 ექსპერტებს ისეთ დარგებში როგორიცაა ინტერნეტი და Unicode-ი,
cannam@85 104 ინტერნაციონალიზაცია და ლოკალიზაცია, Unicode-ის გამოყენება
cannam@85 105 ოპერაციულ სისტემებსა, და გამოყენებით პროგრამებში, შრიფტებში,
cannam@85 106 ტექსტების დამუშავებასა და მრავალენოვან კომპიუტერულ სისტემებში.
cannam@85 107
cannam@85 108 Russian:
cannam@85 109
cannam@85 110 From a Unicode conference invitation:
cannam@85 111
cannam@85 112 Зарегистрируйтесь сейчас на Десятую Международную Конференцию по
cannam@85 113 Unicode, которая состоится 10-12 марта 1997 года в Майнце в Германии.
cannam@85 114 Конференция соберет широкий круг экспертов по вопросам глобального
cannam@85 115 Интернета и Unicode, локализации и интернационализации, воплощению и
cannam@85 116 применению Unicode в различных операционных системах и программных
cannam@85 117 приложениях, шрифтах, верстке и многоязычных компьютерных системах.
cannam@85 118
cannam@85 119 Thai (UCS Level 2):
cannam@85 120
cannam@85 121 Excerpt from a poetry on The Romance of The Three Kingdoms (a Chinese
cannam@85 122 classic 'San Gua'):
cannam@85 123
cannam@85 124 [----------------------------|------------------------]
cannam@85 125 ๏ แผ่นดินฮั่นเสื่อมโทรมแสนสังเวช พระปกเกศกองบู๊กู้ขึ้นใหม่
cannam@85 126 สิบสองกษัตริย์ก่อนหน้าแลถัดไป สององค์ไซร้โง่เขลาเบาปัญญา
cannam@85 127 ทรงนับถือขันทีเป็นที่พึ่ง บ้านเมืองจึงวิปริตเป็นนักหนา
cannam@85 128 โฮจิ๋นเรียกทัพทั่วหัวเมืองมา หมายจะฆ่ามดชั่วตัวสำคัญ
cannam@85 129 เหมือนขับไสไล่เสือจากเคหา รับหมาป่าเข้ามาเลยอาสัญ
cannam@85 130 ฝ่ายอ้องอุ้นยุแยกให้แตกกัน ใช้สาวนั้นเป็นชนวนชื่นชวนใจ
cannam@85 131 พลันลิฉุยกุยกีกลับก่อเหตุ ช่างอาเพศจริงหนาฟ้าร้องไห้
cannam@85 132 ต้องรบราฆ่าฟันจนบรรลัย ฤๅหาใครค้ำชูกู้บรรลังก์ ฯ
cannam@85 133
cannam@85 134 (The above is a two-column text. If combining characters are handled
cannam@85 135 correctly, the lines of the second column should be aligned with the
cannam@85 136 | character above.)
cannam@85 137
cannam@85 138 Ethiopian:
cannam@85 139
cannam@85 140 Proverbs in the Amharic language:
cannam@85 141
cannam@85 142 ሰማይ አይታረስ ንጉሥ አይከሰስ።
cannam@85 143 ብላ ካለኝ እንደአባቴ በቆመጠኝ።
cannam@85 144 ጌጥ ያለቤቱ ቁምጥና ነው።
cannam@85 145 ደሀ በሕልሙ ቅቤ ባይጠጣ ንጣት በገደለው።
cannam@85 146 የአፍ ወለምታ በቅቤ አይታሽም።
cannam@85 147 አይጥ በበላ ዳዋ ተመታ።
cannam@85 148 ሲተረጉሙ ይደረግሙ።
cannam@85 149 ቀስ በቀስ፥ ዕንቁላል በእግሩ ይሄዳል።
cannam@85 150 ድር ቢያብር አንበሳ ያስር።
cannam@85 151 ሰው እንደቤቱ እንጅ እንደ ጉረቤቱ አይተዳደርም።
cannam@85 152 እግዜር የከፈተውን ጉሮሮ ሳይዘጋው አይድርም።
cannam@85 153 የጎረቤት ሌባ፥ ቢያዩት ይስቅ ባያዩት ያጠልቅ።
cannam@85 154 ሥራ ከመፍታት ልጄን ላፋታት።
cannam@85 155 ዓባይ ማደሪያ የለው፥ ግንድ ይዞ ይዞራል።
cannam@85 156 የእስላም አገሩ መካ የአሞራ አገሩ ዋርካ።
cannam@85 157 ተንጋሎ ቢተፉ ተመልሶ ባፉ።
cannam@85 158 ወዳጅህ ማር ቢሆን ጨርስህ አትላሰው።
cannam@85 159 እግርህን በፍራሽህ ልክ ዘርጋ።
cannam@85 160
cannam@85 161 Runes:
cannam@85 162
cannam@85 163 ᚻᛖ ᚳᚹᚫᚦ ᚦᚫᛏ ᚻᛖ ᛒᚢᛞᛖ ᚩᚾ ᚦᚫᛗ ᛚᚪᚾᛞᛖ ᚾᚩᚱᚦᚹᛖᚪᚱᛞᚢᛗ ᚹᛁᚦ ᚦᚪ ᚹᛖᛥᚫ
cannam@85 164
cannam@85 165 (Old English, which transcribed into Latin reads 'He cwaeth that he
cannam@85 166 bude thaem lande northweardum with tha Westsae.' and means 'He said
cannam@85 167 that he lived in the northern land near the Western Sea.')
cannam@85 168
cannam@85 169 Braille:
cannam@85 170
cannam@85 171 ⡌⠁⠧⠑ ⠼⠁⠒ ⡍⠜⠇⠑⠹⠰⠎ ⡣⠕⠌
cannam@85 172
cannam@85 173 ⡍⠜⠇⠑⠹ ⠺⠁⠎ ⠙⠑⠁⠙⠒ ⠞⠕ ⠃⠑⠛⠔ ⠺⠊⠹⠲ ⡹⠻⠑ ⠊⠎ ⠝⠕ ⠙⠳⠃⠞
cannam@85 174 ⠱⠁⠞⠑⠧⠻ ⠁⠃⠳⠞ ⠹⠁⠞⠲ ⡹⠑ ⠗⠑⠛⠊⠌⠻ ⠕⠋ ⠙⠊⠎ ⠃⠥⠗⠊⠁⠇ ⠺⠁⠎
cannam@85 175 ⠎⠊⠛⠝⠫ ⠃⠹ ⠹⠑ ⠊⠇⠻⠛⠹⠍⠁⠝⠂ ⠹⠑ ⠊⠇⠻⠅⠂ ⠹⠑ ⠥⠝⠙⠻⠞⠁⠅⠻⠂
cannam@85 176 ⠁⠝⠙ ⠹⠑ ⠡⠊⠑⠋ ⠍⠳⠗⠝⠻⠲ ⡎⠊⠗⠕⠕⠛⠑ ⠎⠊⠛⠝⠫ ⠊⠞⠲ ⡁⠝⠙
cannam@85 177 ⡎⠊⠗⠕⠕⠛⠑⠰⠎ ⠝⠁⠍⠑ ⠺⠁⠎ ⠛⠕⠕⠙ ⠥⠏⠕⠝ ⠰⡡⠁⠝⠛⠑⠂ ⠋⠕⠗ ⠁⠝⠹⠹⠔⠛ ⠙⠑
cannam@85 178 ⠡⠕⠎⠑ ⠞⠕ ⠏⠥⠞ ⠙⠊⠎ ⠙⠁⠝⠙ ⠞⠕⠲
cannam@85 179
cannam@85 180 ⡕⠇⠙ ⡍⠜⠇⠑⠹ ⠺⠁⠎ ⠁⠎ ⠙⠑⠁⠙ ⠁⠎ ⠁ ⠙⠕⠕⠗⠤⠝⠁⠊⠇⠲
cannam@85 181
cannam@85 182 ⡍⠔⠙⠖ ⡊ ⠙⠕⠝⠰⠞ ⠍⠑⠁⠝ ⠞⠕ ⠎⠁⠹ ⠹⠁⠞ ⡊ ⠅⠝⠪⠂ ⠕⠋ ⠍⠹
cannam@85 183 ⠪⠝ ⠅⠝⠪⠇⠫⠛⠑⠂ ⠱⠁⠞ ⠹⠻⠑ ⠊⠎ ⠏⠜⠞⠊⠊⠥⠇⠜⠇⠹ ⠙⠑⠁⠙ ⠁⠃⠳⠞
cannam@85 184 ⠁ ⠙⠕⠕⠗⠤⠝⠁⠊⠇⠲ ⡊ ⠍⠊⠣⠞ ⠙⠁⠧⠑ ⠃⠑⠲ ⠔⠊⠇⠔⠫⠂ ⠍⠹⠎⠑⠇⠋⠂ ⠞⠕
cannam@85 185 ⠗⠑⠛⠜⠙ ⠁ ⠊⠕⠋⠋⠔⠤⠝⠁⠊⠇ ⠁⠎ ⠹⠑ ⠙⠑⠁⠙⠑⠌ ⠏⠊⠑⠊⠑ ⠕⠋ ⠊⠗⠕⠝⠍⠕⠝⠛⠻⠹
cannam@85 186 ⠔ ⠹⠑ ⠞⠗⠁⠙⠑⠲ ⡃⠥⠞ ⠹⠑ ⠺⠊⠎⠙⠕⠍ ⠕⠋ ⠳⠗ ⠁⠝⠊⠑⠌⠕⠗⠎
cannam@85 187 ⠊⠎ ⠔ ⠹⠑ ⠎⠊⠍⠊⠇⠑⠆ ⠁⠝⠙ ⠍⠹ ⠥⠝⠙⠁⠇⠇⠪⠫ ⠙⠁⠝⠙⠎
cannam@85 188 ⠩⠁⠇⠇ ⠝⠕⠞ ⠙⠊⠌⠥⠗⠃ ⠊⠞⠂ ⠕⠗ ⠹⠑ ⡊⠳⠝⠞⠗⠹⠰⠎ ⠙⠕⠝⠑ ⠋⠕⠗⠲ ⡹⠳
cannam@85 189 ⠺⠊⠇⠇ ⠹⠻⠑⠋⠕⠗⠑ ⠏⠻⠍⠊⠞ ⠍⠑ ⠞⠕ ⠗⠑⠏⠑⠁⠞⠂ ⠑⠍⠏⠙⠁⠞⠊⠊⠁⠇⠇⠹⠂ ⠹⠁⠞
cannam@85 190 ⡍⠜⠇⠑⠹ ⠺⠁⠎ ⠁⠎ ⠙⠑⠁⠙ ⠁⠎ ⠁ ⠙⠕⠕⠗⠤⠝⠁⠊⠇⠲
cannam@85 191
cannam@85 192 (The first couple of paragraphs of "A Christmas Carol" by Dickens)
cannam@85 193
cannam@85 194 Compact font selection example text:
cannam@85 195
cannam@85 196 ABCDEFGHIJKLMNOPQRSTUVWXYZ /0123456789
cannam@85 197 abcdefghijklmnopqrstuvwxyz £©µÀÆÖÞßéöÿ
cannam@85 198 –—‘“”„†•…‰™œŠŸž€ ΑΒΓΔΩαβγδω АБВГДабвгд
cannam@85 199 ∀∂∈ℝ∧∪≡∞ ↑↗↨↻⇣ ┐┼╔╘░►☺♀ fi�⑀₂ἠḂӥẄɐː⍎אԱა
cannam@85 200
cannam@85 201 Greetings in various languages:
cannam@85 202
cannam@85 203 Hello world, Καλημέρα κόσμε, コンニチハ
cannam@85 204
cannam@85 205 Box drawing alignment tests: █
cannam@85 206
cannam@85 207 ╔══╦══╗ ┌──┬──┐ ╭──┬──╮ ╭──┬──╮ ┏━━┳━━┓ ┎┒┏┑ ╷ ╻ ┏┯┓ ┌┰┐ ▊ ╱╲╱╲╳╳╳
cannam@85 208 ║┌─╨─┐║ │╔═╧═╗│ │╒═╪═╕│ │╓─╁─╖│ ┃┌─╂─┐┃ ┗╃╄┙ ╶┼╴╺╋╸┠┼┨ ┝╋┥ ▋ ╲╱╲╱╳╳╳
cannam@85 209 ║│╲ ╱│║ │║ ║│ ││ │ ││ │║ ┃ ║│ ┃│ ╿ │┃ ┍╅╆┓ ╵ ╹ ┗┷┛ └┸┘ ▌ ╱╲╱╲╳╳╳
cannam@85 210 ╠╡ ╳ ╞╣ ├╢ ╟┤ ├┼─┼─┼┤ ├╫─╂─╫┤ ┣┿╾┼╼┿┫ ┕┛┖┚ ┌┄┄┐ ╎ ┏┅┅┓ ┋ ▍ ╲╱╲╱╳╳╳
cannam@85 211 ║│╱ ╲│║ │║ ║│ ││ │ ││ │║ ┃ ║│ ┃│ ╽ │┃ ░░▒▒▓▓██ ┊ ┆ ╎ ╏ ┇ ┋ ▎
cannam@85 212 ║└─╥─┘║ │╚═╤═╝│ │╘═╪═╛│ │╙─╀─╜│ ┃└─╂─┘┃ ░░▒▒▓▓██ ┊ ┆ ╎ ╏ ┇ ┋ ▏
cannam@85 213 ╚══╩══╝ └──┴──┘ ╰──┴──╯ ╰──┴──╯ ┗━━┻━━┛ ▗▄▖▛▀▜ └╌╌┘ ╎ ┗╍╍┛ ┋ ▁▂▃▄▅▆▇█
cannam@85 214 ▝▀▘▙▄▟
cannam@85 215 """ .
cannam@85 216 <> rdfs:comment """
cannam@85 217 Two byte Unicode escape: \u00E0
cannam@85 218 Largest Unicode escape in Turtle: \U0010FFFF
cannam@85 219 """ .