I was thinking about adding to the specification how the current date/time
transforms work with source columns that are longs. I don't think it helps
much to introduce a new transform, although that would make the
interpretation of the source data more obvious to users applying the
transform.

On Wed, Sep 11, 2024 at 10:26 PM Micah Kornfield <emkornfi...@gmail.com>
wrote:

> Maybe we could update the time-based partition functions to be applied to
>> a long column directly. It would treat that column like a timestamp in
>> milliseconds. Would that work? I need to think more about the implications
>> of doing that, but I don't think that we currently have an issue with
>> extending the supported source column types like that.
>
>
> By update do you mean adding new functions like
> HOUR_FROM_UNIX_EPOCH_MILLIS or overloading the definition to accept Long
> values and treat them as milliseconds. The former seems more reasonable to
> me. The latter I think has many of the same draw-backs raised on the other
> thread.  IMO, both aren't super pleasing from a long term maintainability
> perspective.
>
> Cheers,
> Micah
>
> On Tue, Sep 10, 2024 at 8:50 AM rdb...@gmail.com <rdb...@gmail.com> wrote:
>
>> Maybe we could update the time-based partition functions to be applied to
>> a long column directly. It would treat that column like a timestamp in
>> milliseconds. Would that work? I need to think more about the implications
>> of doing that, but I don't think that we currently have an issue with
>> extending the supported source column types like that.
>>
>> On Mon, Sep 9, 2024 at 9:05 PM Manu Zhang <owenzhang1...@gmail.com>
>> wrote:
>>
>>> Hi all,
>>>
>>> I'd like to bump this thread since we don't want to allow long to
>>> timestamp promotion in V3
>>> <https://lists.apache.org/thread/79y866zdhs2fmyv0nsfq3xvdsmqh7h8c>.
>>> What other options do we have?
>>>
>>> Thanks,
>>> Manu
>>>
>>> On Fri, Apr 5, 2024 at 12:09 AM Jean-Baptiste Onofré <j...@nanthrax.net>
>>> wrote:
>>>
>>>> Ah yes, milestone is fine. Thanks !
>>>>
>>>> All good.
>>>>
>>>> Regards
>>>> JB
>>>>
>>>> On Thu, Apr 4, 2024 at 5:12 PM Eduard Tudenhoefner <edu...@tabular.io>
>>>> wrote:
>>>> >
>>>> > There is the V3 Spec milestone where it's tracked (amongst other
>>>> things).
>>>> >
>>>> > On Thu, Apr 4, 2024 at 9:44 AM Jean-Baptiste Onofré <j...@nanthrax.net>
>>>> wrote:
>>>> >>
>>>> >> Hi Eduard,
>>>> >>
>>>> >> Thanks for the update ! It makes sense to me.
>>>> >>
>>>> >> Maybe a GH label with spec or v3_spec would help to see what is
>>>> planned for v3 ?
>>>> >>
>>>> >> Regards
>>>> >> JB
>>>> >>
>>>> >> On Thu, Apr 4, 2024 at 9:36 AM Eduard Tudenhoefner <
>>>> edu...@tabular.io> wrote:
>>>> >> >
>>>> >> > Type promotion from Long to Timestamp is on the roadmap for the V3
>>>> Spec, so that would be the preferred way.
>>>> >> >
>>>> >> > On Wed, Apr 3, 2024 at 10:38 AM Jean-Baptiste Onofré <
>>>> j...@nanthrax.net> wrote:
>>>> >> >>
>>>> >> >> Hi Manu
>>>> >> >>
>>>> >> >> TIMESTAMP_LONG type promotion could be the easiest way, it would
>>>> work
>>>> >> >> with the existing transform.
>>>> >> >>
>>>> >> >> Would it work for you?
>>>> >> >>
>>>> >> >> Regards
>>>> >> >> JB
>>>> >> >>
>>>> >> >> On Wed, Apr 3, 2024 at 5:56 AM Manu Zhang <
>>>> owenzhang1...@gmail.com> wrote:
>>>> >> >> >
>>>> >> >> > Hi all,
>>>> >> >> >
>>>> >> >> > We have source data with a timestamp field in LONG type to land
>>>> in an Iceberg table. We want to partition the table with the timestamp
>>>> field such that we can query recent data more efficiently. However, LONG is
>>>> not supported as the source type of time-based transform (hour, day, etc)
>>>> >> >> >
>>>> >> >> > I find the previous discussion
>>>> https://github.com/apache/iceberg/issues/417 and Ryan suggested two
>>>> solutions
>>>> >> >> >
>>>> >> >> > 1. type promotion from LONG to TIMESTAMP
>>>> >> >> > 2. custom transform
>>>> >> >> >
>>>> >> >> > As I understand it, neither solution has already been
>>>> implemented yet. Is there any progress in either direction? Which solution
>>>> does the community prefer? Any other suggestions are also appreciated.
>>>> >> >> >
>>>> >> >> > Thanks,
>>>> >> >> > Manu
>>>>
>>>

Reply via email to