[ https://issues.apache.org/jira/browse/CRUNCH-489?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ]
Josh Wills updated CRUNCH-489: ------------------------------ Attachment: CRUNCH-489c.patch Latest cut at this, which I'm happier with-- I changed the create() interface to be one of: PCollection<T> create(Iterable<T>, PType<T>) PCollection<T> create(Iterable<T>, PType<T>, CreateOptions) PTable<K, V> create(Iterable<Pair<K, V>>, PTableType<K, V>) PTable<K, V> create(Iterable<Pair<K, V>>, PTableType<K, V>, CreateOptions) ...where CreateOptions can hold a name for the new PCollection and a parallelism level to use (which we may ignore, depending on the Pipeline type.) I think that to make this work, I need to add a read() method that can take in not only a Source, but also a name that we want to give the resulting PCollection, which seems like a generally useful thing. Thoughts [~gabriel.reid] ? (Oh yes, and I need to add a Scala interface for [~davw].) > Add methods to create PCollections from Java Iterable to Pipeline interface > --------------------------------------------------------------------------- > > Key: CRUNCH-489 > URL: https://issues.apache.org/jira/browse/CRUNCH-489 > Project: Crunch > Issue Type: Bug > Reporter: Josh Wills > Attachments: CRUNCH-489.patch, CRUNCH-489b.patch, CRUNCH-489c.patch > > -- This message was sent by Atlassian JIRA (v6.3.4#6332)