Sunday, December 2, 2012

yogi_Remove All Diacritics From Letters Of Text In Column A


                                          Google Spreadsheet   Post  #905
Yogi Anand, D.Eng, P.E.      ANAND Enterprises LLC -- Rochester Hills MI     www.energyefficientbuild.com.    Dec 1, 2012
user Grengounours said:(http://productforums.google.com/forum/#!category-topic/docs/spreadsheets/GLMVebWitDA)
Removing all diacritics

Hi all,
I would like to know if there is a function or script that can remove diacritics in a cell.
Meaning transforming "Bogosavljević Bošković" in "Bogosavljevic Boskovic".
G.D.

----------------------------------------------------------------------------
I am not aware of any function that would remove the diacritics ... so
I used the SUBSTITUTE function to replace letters with diacritics to those without diacritics and the REGEXREPLACE function to delete other non-alaphabets for a bit more generalized situation ... as presented in the following

1 comment:

  1. This one is more complete:
    =ArrayFormula(regexreplace(substitute(substitute(substitute(substitute(substitute(substitute(substitute(substitute(substitute(substitute(substitute(substitute(substitute(C2,"Á","A"),"á","a"),"ó","o"),"č","c"),"ć","c"),"đ","d"),"š","s"),"ž","z"),"Č","C"),"Ć","C"),"Đ","D"),"Š","S"),"Ž","Z"),"[^a-zA-Z ]+",))
    it has all Serbian letters and a few hungarian ones.

    ReplyDelete