Channel: Recent Questions - Stack Overflow

↧

Spark Dataframe and RDD's are making my code slower

June 8, 2024, 2:27 am

≫ Next: Is it possible to count distinct with multiple conditions using MS Excel 2010?

≪ Previous: How can I Enabling JDBC Driver Uploads on jasper server version 9?

    d2v_rdd = spark.sparkContext.textFile("")    for row in d2v_rdd.collect():        row_elements = row.split("\t")        vector_dict[row_elements[0]] = np.array(row_elements[1:][0])    #Getting the dim features from the products file    products_rdd = spark.sparkContext.textFile("")    for row in products_rdd.collect():        row_elements = row.split("\t")

The dataset has 431907 rows

I have the above lines of code implemented in three different forms:

the python with open("") method
reading it into a spark dataframe spark.read.csv
the above shown RDD format

I was expecting the code to be faster with a spark dataframe but turns out that the most efficient method is the context manager with open("")

Any reason why this might be happening?

↧

Trending Articles

RAMAYAMPET Mandal Sarpanch | Upa-Sarpanch | Ward member Mobile Numbers Medak...

May 24, 2017, 2:00 am

लड़कियां सेक्स के दौरान क्यों करती है उह! आह!लड़कियां सेक्स के दौरान क्यों करती...

May 19, 2016, 1:54 am

Neem Baba Extra Questions Answer Class 6 English Poorvi

February 1, 2025, 5:19 am

Throw Back: 4×4 — Sikilitele (Ft Castro) Prod by JQ

March 5, 2015, 8:24 am

Rajasthan Board 10th Result 2016 Roll No wise & Name Wise

August 20, 2016, 5:13 pm

Lowe faces four theft charges

November 14, 2017, 6:52 pm

Practice Sheet of Right form of verbs for HSC Students

September 22, 2019, 11:40 pm

Mafia, Murder & Mayhem In The Motor City: Detroit Mob Hit Timeline (1937-2007)

December 7, 2016, 3:57 pm

The 10 Tennessee Cities With The Largest Black Population For 2021

December 21, 2020, 10:12 am

Materials Around Us Class 6 Worksheet Science Chapter 6

October 3, 2024, 5:20 am

デスクトップヒープの枯渇

January 18, 2018, 8:31 pm

Best Suvichar in Hindi |बेस्ट सुविचार |शुभ विचार हिंदी में

March 7, 2020, 11:19 pm

Kanulanu Thaake Lyrics and translation | Manam (2014)

May 9, 2014, 5:45 am

Korean Sex Porn Videos: XXX Videos & Free Porn Movies

May 30, 2025, 9:29 pm

Teen Shot In Miami Drive-By Dies From Injuries

August 8, 2011, 1:16 pm

Download: IQ Muzatasha feat Shy D & Pmj – Ulesi NiFertilizer Yamavuto

March 22, 2018, 7:23 pm

Mahakal Attitude Status

February 29, 2020, 9:52 am

Property developer set up cannabis factory to help pay off debts...

August 3, 2015, 2:29 am

♡

July 11, 2015, 6:15 am

KB: How to troubleshoot issues when adding a Hyper-V host in System Center...

August 14, 2012, 10:05 am

© 2026 //www.rssing.com