[GitHub] madlib issue #325: Madpack/ic func schema

jingyimei Fri, 28 Sep 2018 18:26:52 -0700

Github user jingyimei commented on the issue:

    https://github.com/apache/madlib/pull/325
  
    @iyerr3 
    
    > I'm getting confused here, so maybe an example would help.
    > 
    > My understanding of @orhankislal 's comment was that say `page_rank` and 
`bfs` both use the same table called "vertex" then we will have to drop this 
table at the beginning of each of the test files. This may then collide with a 
user-created "vertex" table - hence this commit.
    
    > My suggestions to solve the problem:
    > 
    > * Use a function-specific name (`"pagerank_vertex"`) instead of a generic 
name with (`"vertex"`) no drop by in the beginning.
    
    This seems doesn't solve the potential conflict with user created table. 
Besides, in some IC/DC file we drop tables in between and create table with the 
same name again. In this case the second drop/creation will fail if user 
already created it.
    > * Create a single `"vertex"` table (with no drop by statement) and use 
that for all graph functions
    
    This may cause some assertion based on returning number of records fail as 
I mentioned in the previous comment, and also needs effort of audition and 
combining test data.
    > * Schema-qualify the data table with the install-check schema name.
    > 
    (not sure if I totally understand this) install-check schema is based on 
module, not algo, for example pagerank and hits and sssp use the same IC 
schema, and there will be issue between running two sql files for 
droping/creating same table.
    
    > Please note I'm not suggesting pooling **all** install-check data into a 
single file (at least not as a solution to this problem).
    
    I understood your point. What I was trying to say was multiple algos in the 
same module share the same data creation file.

---

[GitHub] madlib issue #325: Madpack/ic func schema

Reply via email to