Questions tagged [accent-insensitive]

Collation which considers the accented and unaccented versions of letters to be identical for sorting purposes.

54 questions
48
votes
2 answers

Programatic Accent Reduction in JavaScript (aka text normalization or unaccenting)

I need to compare 2 strings as equal such as these: Lubeck == Lübeck In JavaScript. Why? Well, I have an auto-completion field that's going out to a Java service using Lucene, where place names are stored naturally (as Lübeck), but also indexed as…
20
votes
2 answers

Compare strings ignoring accented characters

I would like to know if there is a method that compares 2 strings and ignores the accents making "noção" equal to "nocao". it would be something like string1.methodCompareIgnoreAccent(string2);
alexandre1985
  • 966
  • 3
  • 11
  • 26
14
votes
8 answers

jQuery DataTables - Accent-Insensitive Alphabetization and Searching

When using jQuery DataTables is it possible to do accent-insensitive searches when using the filter? For instance, when I put the 'e' character, I'd like to search every word with 'e' or 'é', 'è'. Something that came to mind is normalizing the…
12
votes
7 answers

MySQL REGEXP query - accent insensitive search

I'm looking to query a database of wine names, many of which contain accents (but not in a uniform way, and so similar wines may be entered with or without accents) The basic query looks like this: SELECT * FROM `table` WHERE `wine_name` REGEXP…
freestate
  • 907
  • 2
  • 9
  • 14
11
votes
1 answer

MongoDB diacriticInSensitive search not showing all accented (words with diacritic mark) rows as expected and vice-versa

I have a document collection with following structure uid, name With a Index db.Collection.createIndex({name: "text"}) It contains following data 1, iphone 2, iphóne 3, iphonë 4, iphónë When I am doing text search for iphone I am getting only…
9
votes
1 answer

Using unique() and == to match accented vs. non-accented characters

I'm putting together some tables that look almost the same, except that some characters appear accented in some and non-accented in others. For instance, "André" sometimes reads "Andre", "Flávio" and "Flavio", etc. I need to consider all variations…
Rodrigo
  • 3,829
  • 6
  • 41
  • 72
9
votes
2 answers

How to compare strings with case insensitive and accent insensitive

How to compare strings with case insensitive and accent insensitive Alright this is done easily at SQL server However I would like to do the same at C# .NET 4.5.1. How can I do that with most proper way? I mean these 3 strings should return equal…
MonsterMMORPG
  • 20,310
  • 69
  • 183
  • 306
9
votes
1 answer

SQLite accent-insensitive search

Is there any way to do an accent-insensitive LIKE query in SQLite? For example, this query: SELECT * FROM users WHERE name LIKE "Andre%" would return: André the Giant Andre Agassi etc. I'm using Qt with QSqlDatabase if it makes any difference.
laurent
  • 79,308
  • 64
  • 256
  • 389
7
votes
1 answer

Regex - match a character and all its diacritic variations (aka accent-insensitive)

I am trying to match a character and all its possible diacritic variations (aka accent-insensitive) with a regular expression. What I could do of course is: re.match(r"^[eēéěèȅêęëėẹẽĕȇȩę̋ḕḗḙḛḝė̄]$", "é") but that is not a general solution. If I use…
Felk
  • 6,165
  • 1
  • 27
  • 51
6
votes
1 answer

unaccent() does not work with Greek letters in plpgsql dynamic query

I use PostgreSQL 10 and I run CREATE EXTENSION unaccent; succesfully. I have a plgsql function that contains the following whereText := 'lower(unaccent(place.name)) LIKE lower(unaccent($1))'; later, according to what user chose, more clauses may be…
slevin
  • 3,640
  • 16
  • 58
  • 106
6
votes
2 answers

Regex for accent insensitive replacement in python

In Python 3, I'd like to be able to use re.sub() in an "accent-insensitive" way, as we can do with the re.I flag for case-insensitive substitution. Could be something like a re.IGNOREACCENTS flag: original_text = "¿It's 80°C, I'm drinking a café in…
6
votes
1 answer

bootstrap-select: can one use accent insensitive searches?

I am using bootstrap-select for a form using data-live-search enabled, so that the user can both type the search term and find it in the dropdown. The dropdown list, however, has some terms with accents, like for example: León Castellón…
PaulJ
  • 1,410
  • 4
  • 25
  • 43
6
votes
2 answers

Unicode comparison of Cyrillic 'С' and Latin 'C'

I have a dataset which mixes use of unicode characters \u0421, 'С' and \u0043, 'C'. Is there some sort of unicode comparison which considers those two characters the same? So far I've tried several ICU collations, including the Russian one.
5
votes
1 answer

Questions about accent insensitivity in SQL Server (Latin1_General_CI_AS)

All our databases were installed using the default collation (Latin1_General_CI_AS). We plan to change the collation to allow clients to search the database with accent insensitivity. Questions: What are the negatives (if any) of having an accent…
Brett Postin
  • 10,721
  • 9
  • 58
  • 93
4
votes
1 answer

Java remove punctuation on a String (also ’ “ ” and all of these) maintaining accents characters

I need to remove punctuation reading on a file, maintaining accents character I tried this code but don't work how I would. Expectation: input=> ’'qwe..,rty ‘èeéò’“ ”o" "à output=> qwertyèeéòoà Effective result: input=> ’'qwe..,rty ‘èeéò’“ ”o"…
1
2 3 4