Hi Philippe, I would like to draw your attention to this great library that saved my day in the past when parsing phone numbers in Spark:
https://github.com/google/libphonenumber If you combine it with Bjørn's suggestions you will have a good start on your linkage task. Best regards, Anastasios Zouzias On Sat, Apr 1, 2023 at 8:31 PM Philippe de Rochambeau <phi...@free.fr> wrote: > Hello, > I’m looking for an efficient way in Spark to search for a series of > telephone numbers, contained in a CSV file, in a data set column. > > In pseudo code, > > for tel in [tel1, tel2, …. tel40,000] > search for tel in dataset using .like(« %tel% ») > end for > > I’m using the like function because the telephone numbers in the data set > main contain prefixes, such as « + « ; e.g., « +3312224444 ». > > Any suggestions would be welcome. > > Many thanks. > > Philippe > > > > > > --------------------------------------------------------------------- > To unsubscribe e-mail: user-unsubscr...@spark.apache.org > > -- -- Anastasios Zouzias <a...@zurich.ibm.com>