Github user jingyimei commented on the issue:
https://github.com/apache/madlib/pull/325
@iyerr3
> I'm getting confused here, so maybe an example would help.
>
> My understanding of @orhankislal 's comment was that say `page_rank` and
`bfs` both use the same table called "vertex" then we will have to drop this
table at the beginning of each of the test files. This may then collide with a
user-created "vertex" table - hence this commit.
> My suggestions to solve the problem:
>
> * Use a function-specific name (`"pagerank_vertex"`) instead of a generic
name with (`"vertex"`) no drop by in the beginning.
This seems doesn't solve the potential conflict with user created table.
Besides, in some IC/DC file we drop tables in between and create table with the
same name again. In this case the second drop/creation will fail if user
already created it.
> * Create a single `"vertex"` table (with no drop by statement) and use
that for all graph functions
This may cause some assertion based on returning number of records fail as
I mentioned in the previous comment, and also needs effort of audition and
combining test data.
> * Schema-qualify the data table with the install-check schema name.
>
(not sure if I totally understand this) install-check schema is based on
module, not algo, for example pagerank and hits and sssp use the same IC
schema, and there will be issue between running two sql files for
droping/creating same table.
> Please note I'm not suggesting pooling **all** install-check data into a
single file (at least not as a solution to this problem).
I understood your point. What I was trying to say was multiple algos in the
same module share the same data creation file.
---