掲示板 Forums - Ghost Kanji entry
Top > renshuu.org > Bugs / Problems Getting the posts
Top > renshuu.org > Bugs / Problems
𢦏, a phonetic component which is not a radical and is also not used as a character, is not present in the kanji dictionary. That’s as one would expect; however, if you search for 𢦏, you get this:
Note that the above is 𠮟, not 叱. 𠮟 was not previously in the kanji dictionary; now it is, but it seems to have been added in a half-broken way.
There are also dozens of irregular vocabulary items, none of which belong there. A few of them could arguably be regular readings, but most of them are simply garbage.
This is very interesting. I do not have time to look into it, but I have to wonder if the data encoding of the database column in mysql is causing these to come up as the same code point.
i dun think so, i replied after Michael b/c i wanna see update pings here
Let me ask again, just to make sure there is no misunderstanding. @マイコー, did you administratively remove a post for security reasons? Would you like me to send you the information again via DM?
Good, I didn’t think so, but I couldn’t rule it out.
In response to your comment that the problem might be in MySQL, I speculated that the issue most likely was tied to error handling. Specifically, I advised you to check the code that runs after a null query result for type conversion errors that might strip away bits.
Yea, not sure where that post went. Anyway, It's not..any of that, although I appreciate the suggestion!
So, while the collision is most likely due to the encoding, most of those kanji are not in renshuu's database to begin with, so when I fix it (not now), it'll simply return a "kanji not found", and not the actual kanji. They can be added, but due to the rarity of these kanji, it's simply not something I can work on at the moment.
I think I found what it is. If the unicode code point >= U+10000, then this bug happens. I don't know if there are other cases though.
Here are a few:
𠀋 𠂉 𠂢 𠂤 𠆢 𠈓 𠌫 𠍱 𠎁 𠏹 𠑊 𠔉 𠗖 𠘨 𠝏 𠠇 𠠺 𠢹 𠥼 𠦝 𠫓 𠬝 𠮟 𠵅 𠷡 𠹤 𠹭 𠺕 𠽟 𡈁 𡈽 𡉕 𡉴 𡉻 𡋗 𡋤 𡋽 𡌛 𡌶 𡍄 𡏄 𡑭 𡑮 𡗗 𡙇 𡚴 𡜆 𡝂 𡢽 𡧃 𡱖 𡴭 𡵅 𡵢 𡵸 𡶒 𡶜 𡶡 𡶷 𡷠 𡸳 𡸴 𡼞 𡽶 𡿺 𢅻 𢈘 𢌞 𢎭 𢛳 𢡛 𢢫 𢦏 𢪸 𢭆 𢭏 𢭐 𢮦 𢰝 𢰤 𢷡 𣆶 𣇃 𣇄 𣇵 𣍲 𣏐 𣏒 𣏓 𣏕 𣏚 𣏟 𣏤 𣑊 𣑋 𣑑 𣑥 𣓤 𣕚 𣖔 𣗄 𣘸 𣘹 𣘺 𣙇 𣜌 𣜜 𣜿 𣝣 𣝤 𣟧 𣟿 𣠤 𣠽 𣪘 𣱿 𣳾 𣴀 𣴎 𣵀 𣷓 𣷹 𣷺 𣽾 𤂖 𤄃 𤇆 𤇾 𤎼 𤘩 𤚥 𤟱 𤢖 𤩍 𤭖 𤭯 𤰖 𤴔 𤸎 𤸷 𤹪 𤺋 𥁊 𥁕 𥄢 𥆩 𥇍 𥇥 𥈞 𥉌 𥐮 𥒎 𥓙 𥔎 𥖧 𥝱 𥞩 𥞴 𥧄 𥧔 𥫣 𥫤 𥫱 𥮲 𥱋 𥱤 𥶡 𥸮 𥹖 𥹢 𥹥 𥻂 𥻘 𥻨 𥼣 𥽜 𥿔 𥿠 𥿻 𦀌 𦀗 𦁠 𦃭 𦉰 𦊆 𦍌 𦐂 𦙾 𦚰 𦜝 𦣝 𦣪 𦥑 𦥯 𦧝 𦨞 𦩘 𦪌 𦪷 𦫿 𦰩 𦱳 𦳝 𦹀 𦹥 𦾔 𦿶 𦿷 𦿸 𧃴 𧄍 𧄹 𧏚 𧏛 𧏾 𧐐 𧑉 𧘔 𧘕 𧘱 𧚄 𧚓 𧜎 𧜣 𧝒 𧦅 𧪄 𧮳 𧮾 𧯇 𧲸 𧶠 𧸐 𧾷 𨂊 𨂻 𨉷 𨊂 𨋳 𨏍 𨐌 𨑕 𨕫 𨗈 𨗉 𨛗 𨛺 𨥆 𨥉 𨥫 𨦇 𨦈 𨦺 𨦻 𨨞 𨨩 𨩃 𨩱 𨪙 𨫍 𨫝 𨫤 𨯁 𨯯 𨴐 𨵱 𨷻 𨸟 𨸶 𨺉 𨻫 𨼲 𨿸 𩊠 𩊱 𩒐 𩗏 𩙿 𩛰 𩜙 𩝐 𩣆 𩩲 𩷛 𩸕 𩸽 𩹉 𩺊 𩻄 𩻛 𩻩 𩿎 𪀚 𪀯 𪂂 𪃹 𪆐 𪎌 𪐷 𪗱 𪘂 𪘚 𪚲
Nice find. I think this is turning into a larger issue than first suspected. This thread is probably related: forums/topics/16801/Im_confused
Doubtful that they're linked.
I'm pretty sure that none of those were part of the original kanji imports that I made into renshuu's database. While there is also the code point issue that is tied into the collation used by the database, even if that wasn't happening, most if not all of these simply are not in renshuu's database to begin with, and I do not expect to have the time to add them in the near future.