Sorting is better to do before Join?

Mahulima
Mahulima Contributor

Does sorting of the participating ids in the joining condition helps in better performance? i thought if I use a sort step before Join that will improve my performance but it is actually taking 15+seconds extra when I am using sort step before join, is there any other way to improve the performance time or it is not recommended to use sort before join

Comments

  • Josh Boxer
    Josh Boxer Administrator

    Join step will internally do a sort by key, which is likely why it is faster to not Sort first separately.

  • Mahulima
    Mahulima Contributor

    oh okay thanks for clarifying