90 Pages V « < 88 89 90  
Closed TopicStart new topic
> New Search Engine, No Read, Only Post

 
post Sep 5 2023, 15:06
Post #1781
Tenboro

Admin




QUOTE(peterson123 @ Sep 5 2023, 12:29) *

So did you conciously change the behaviour of the flag? E.g. using it with the /tag/ endpoint it seems to work as expected:
http://ehentaihip.com/tag/cnaruto
http://ehentaihip.com/tag/cnaruto&skip_mastertags=1

But it doesn't work with the f_search option in the URL anymore:
http://ehentaihip.com/?f_search=cnaruto
http://ehentaihip.com/?f_search=cnaruto&skip_mastertags=1 <- still gives the results of the master.

I'm very sure that before, the latter option would also work. Without that, I don't see how the flag has any merit whatsoever (to a normal user).


It doesn't have any merit to a normal user and is not supposed to, it's an internal flag for use with internal tools and has never been advertised as anything else.

I did recently correct an issue where it would still attempt to apply it to normal searches if you provided it manually. Not just for the hell of it, but because it would corrupt the search caches so that if you then did the search normally, it would use the previously cached result and still not include the master tags.
User is offlineProfile CardPM
Go to the top of the page
+Quote Post

 
post Sep 5 2023, 15:40
Post #1782
peterson123



Veteran Poster
********
Group: Members
Posts: 2,744
Joined: 22-February 12
Level 500 (Godslayer)


QUOTE(Tenboro @ Sep 5 2023, 12:06) *
It doesn't have any merit to a normal user and is not supposed to, it's an internal flag for use with internal tools and has never been advertised as anything else.

Maybe you didn't advertise it as such but the wiki certainly recommended the option to normal users (and still does in other places, though I trust Shank will take care of them soon).

To present an actual use case I had today, I wanted to find all unlsaved temp tags starting with "viper", to find un-namespaced tags in the obscure viper series of video games. I know the search tool under My Tags does something similar but it has limited yield and includes slaved temp tags, so it's not as efficient. But if there is a technical reason to throw it out, that's okay. Still sad to see the functionality go, it was a good friend of many years. cry.gif
User is online!Profile CardPM
Go to the top of the page
+Quote Post

 
post Sep 5 2023, 15:50
Post #1783
Shank



Roll for Initiative
**********
Group: Global Mods
Posts: 8,536
Joined: 19-May 12
Level 500 (Ponyslayer)


QUOTE(peterson123 @ Sep 5 2023, 12:40) *
Already removed

QUOTE(peterson123 @ Sep 5 2023, 12:40) *
(and still does in other places, though I trust Shank will take care of them soon).
Now taken care of


--------------------
Tagging Cleanup Contest #13
HV Quest & Activities Board
QUOTE(Kagoraphobia @ Apr 1 2024, 10:06) *
This is an authoritarian dictatorship
QUOTE(Kagoraphobia @ Aug 13 2024, 00:02) *
Democracy has spoken.
User is online!Profile CardPM
Go to the top of the page
+Quote Post

 
post Sep 29 2023, 20:31
Post #1784
peterson123



Veteran Poster
********
Group: Members
Posts: 2,744
Joined: 22-February 12
Level 500 (Godslayer)


Skimming through some old posts has reminded me that there was some demand for being able to do something like "find all galleries with no tags in the artist namespace". I still think that could be useful. Before this big search engine update, it was said to be not viable due to technical limitations. Is it any more viable now?
User is online!Profile CardPM
Go to the top of the page
+Quote Post

 
post Sep 30 2023, 09:58
Post #1785
Tenboro

Admin




QUOTE(peterson123 @ Sep 29 2023, 19:31) *
Skimming through some old posts has reminded me that there was some demand for being able to do something like "find all galleries with no tags in the artist namespace". I still think that could be useful. Before this big search engine update, it was said to be not viable due to technical limitations. Is it any more viable now?


It's probably possible to support it, but there are no plans to add it in the near future.
User is offlineProfile CardPM
Go to the top of the page
+Quote Post

 
post Oct 7 2023, 17:41
Post #1786
ttp...



Newcomer
*
Group: Members
Posts: 43
Joined: 12-August 17
Level 342 (Godslayer)


can support add uploader to watched?
User is offlineProfile CardPM
Go to the top of the page
+Quote Post

 
post Oct 12 2023, 01:56
Post #1787
Blitzkralle



Lurker
Group: Lurkers
Posts: 2
Joined: 29-May 11
Level 13 (Novice)


Hey, I'm trying to use the OR operator like this:
parody:naruto$ ~parody:original$
What I'm aiming for is to filter doujishis with a parody of Naruto, otherwise just show me the original ones. I think I might be using it incorrectly huh.gif . In the end, I'm trying to find a way to search for both manga and doujishis, but if it's a doujishi, only show me the original ones. tnks for any kind of help smile.gif
User is offlineProfile CardPM
Go to the top of the page
+Quote Post

 
post Oct 12 2023, 09:02
Post #1788
peterson123



Veteran Poster
********
Group: Members
Posts: 2,744
Joined: 22-February 12
Level 500 (Godslayer)


QUOTE(Blitzkralle @ Oct 11 2023, 22:56) *
parody:naruto$ ~parody:original$
What I'm aiming for is to filter doujishis with a parody of Naruto, otherwise just show me the original ones.

Try: ~parody:naruto$ ~parody:original$
If you are doing an OR search, you need at least two search items with "~" for it to make sense.


QUOTE(Blitzkralle @ Oct 11 2023, 22:56) *
In the end, I'm trying to find a way to search for both manga and doujishis, but if it's a doujishi, only show me the original ones. tnks for any kind of help smile.gif

I don't think this is possible, you will have to do two separate searches (one for all manga, one for only original doujinshi).
User is online!Profile CardPM
Go to the top of the page
+Quote Post

 
post Oct 13 2023, 00:30
Post #1789
Blitzkralle



Lurker
Group: Lurkers
Posts: 2
Joined: 29-May 11
Level 13 (Novice)


QUOTE(peterson123 @ Oct 12 2023, 08:02) *

Try: ~parody:naruto$ ~parody:original$
If you are doing an OR search, you need at least two search items with "~" for it to make sense.
I don't think this is possible, you will have to do two separate searches (one for all manga, one for only original doujinshi).


Tnks! that one works on the naruto example, a shame about the second one but it is no a big problem!
User is offlineProfile CardPM
Go to the top of the page
+Quote Post

 
post Oct 27 2023, 21:32
Post #1790
Necromusume




*********
Group: Catgirl Camarilla
Posts: 6,249
Joined: 17-May 12
Level 500 (Ponyslayer)


An issue with the search engine came up in the Renaming/Reclassing thread. Someone uploaded a set of galleries named like,
π™ˆπ™€π˜Ώπ™π™Žπ˜Ό|π™π™„π˜Ώπ™€π™ (π™π˜Όπ™π™€ π™Žπ™€π™π™„π™€π™Ž) [𝙋𝙄𝙓𝙄𝙑] 7 {π˜Όπ™ž π™‚π™šπ™£π™šπ™§π™–π™©π™šπ™™}

using the [codepoints.net] Mathematical Alphanumeric Symbols Unicode block, Mathematical Bold Italic characters, for extra style.

The search engine does not normalize those to regular english text, so they aren't searchable.

http://ehentaihip.com/?f_search=π™ˆπ™€π˜Ώπ™π™Žπ˜Ό
"No hits found." (They all got renamed; it would normally be bottom priority because it shouldn't affect the search engine.)

http://ehentaihip.com/?f_search=MEDUSA
"Found 369 results."

Startpage, DuckDuckGo and Wikipedia will all find Medusa if searched with π™ˆπ™€π˜Ώπ™π™Žπ˜Ό.


--------------------
HΜΆΜ’Μža̸̬͌iΜΆΜ„Ν‡lΜ΄Μ‘Μ‘ ̸̳́P̸̊̒hΜ΄Ν’ΜͺyΜ΅Μ”ΜΌrΜΆΝ‹Νœȅ̡̫xΜ΅Μ½Μ¨iΜ΄Ν˜Μ—a̡̟͊
User is offlineProfile CardPM
Go to the top of the page
+Quote Post

 
post Oct 27 2023, 22:02
Post #1791
Tenboro

Admin




There are a lot of characters in Unicode that are similar to ASCII alphanumericals, and even if we were to try to find all of them, people are always going to find new ones. We do remap the ones that are ο½ƒο½ο½ο½ο½ο½Žο½Œο½™γ€€ο½•ο½“ο½…ο½„γ€€ο½‰ο½Žγ€€οΌͺο½ο½ο½ο½Žο½…ο½“ο½… for searching/indexing purposes, but I don't really feel like spending the time trying to create a map of all the other variants.

Incidentally, you say it works on Wikipedia, but Mediawiki doesn't seem to be doing it, so I can't just grab a map from there. Could be Wikipedia runs on a newer version though.
User is offlineProfile CardPM
Go to the top of the page
+Quote Post

 
post Oct 27 2023, 22:11
Post #1792
Necromusume




*********
Group: Catgirl Camarilla
Posts: 6,249
Joined: 17-May 12
Level 500 (Ponyslayer)


I think there's probably an open source library that will handle normalizing unicode text like that.

[www.unicode.org] Unicode Standard Annex #15 - Unicode Normalization Forms

Maybe this.
[github.com] Unicode Normalization forms according to UAX#15 rules

[unicode.org] Unicode FAQ - Normalization
NFKD normalization form looks appropriate for search.

Example of NFKD normalization using an online Unicode transformer.
[util.unicode.org] https://util.unicode.org/UnicodeJsps/transf...%F0%9D%99%99%7D

EHwiki search box doesn't do it,
https://ehwiki.org/index.php?search=%F0%9D%...&fulltext=1

But Wikipedia does.
[en.wikipedia.org] https://en.wikipedia.org/w/index.php?fullte...earch&ns0=1

MediaWiki's documentation page:
[www.mediawiki.org] Unicode normalization considerations
QUOTE
Since 1.4, MediaWiki applies normalization form C (NFC) to Unicode text input.

[www.mediawiki.org] utfnormal
QUOTE
utfnormal is a library that contains Unicode normalization routines. It includes pure PHP implementations, and automatically uses the php-intl extension if installed.


This post has been edited by Necromusume: Oct 27 2023, 23:23


--------------------
HΜΆΜ’Μža̸̬͌iΜΆΜ„Ν‡lΜ΄Μ‘Μ‘ ̸̳́P̸̊̒hΜ΄Ν’ΜͺyΜ΅Μ”ΜΌrΜΆΝ‹Νœȅ̡̫xΜ΅Μ½Μ¨iΜ΄Ν˜Μ—a̡̟͊
User is offlineProfile CardPM
Go to the top of the page
+Quote Post

 
post Nov 22 2023, 12:33
Post #1793
peterson123



Veteran Poster
********
Group: Members
Posts: 2,744
Joined: 22-February 12
Level 500 (Godslayer)


Small oddity:

Searching o:comic$ in western gives "50,000+ results".
Seacrhing other:comic$ in western gives "about 70,000 results".

Both searches seem to yield exactly the same (as expected) but for some reason the result count estimation is more specific for the second one.
User is online!Profile CardPM
Go to the top of the page
+Quote Post

 
post Nov 22 2023, 13:06
Post #1794
Tenboro

Admin




QUOTE(peterson123 @ Nov 22 2023, 10:33) *
Both searches seem to yield exactly the same (as expected) but for some reason the result count estimation is more specific for the second one.


Count estimates are refined as the result set is explored (which is basically free), so most likely the second one just has had more pages sampled.
User is offlineProfile CardPM
Go to the top of the page
+Quote Post

 
post Jan 7 2024, 10:08
Post #1795
shadowfr



Lurker
Group: Lurkers
Posts: 2
Joined: 3-July 11


I think the new engine is really subtle...
User is offlineProfile CardPM
Go to the top of the page
+Quote Post


90 Pages V « < 88 89 90
Closed TopicStart new topic
1 User(s) are reading this topic (0 Guests and 0 Anonymous Users)
1 Members: ww183404

 


Lo-Fi Version Time is now: 21st October 2024 - 18:39