spark for hive delete the table and create a new table with the same name ，The information already exists in the local file spark-warehouse

I'm new to spark，I learned that spark can use hive's metastore, so I tried to use that。I copy hive-site, core-site, and hdfs-site to spark's conf,And put the jar of mysql-connector into the jar directory of spark.Then I can see the databases and table information of previous hive through spark sql, and I can query the data in hive.Then I created a database, created a table, and loaded some data.When you look at hdfs, there are no related database, table, and data, but spark-warehouse in spark_home can see the library tables and data.My question is Why is the data stored locally instead of in hdfs when I configure hive.And when I delete this table and recreate the same table, it indicates that the table file already exists in the local, that is, my delete operation does not delete the local data.I'll add that when I perform a delete table operation, my show tables really don't show the table I deleted.

Upload me the following snippet of my code

    _SPARK_HOST = "local[3]"    _APP_NAME = "test"    spark = SparkSession.builder \        .master(_SPARK_HOST) \        .appName(_APP_NAME) \        .config("spark.sql.shuffle.partitions", "4") \        .config("spark.sql.warehouse.dir", "hdfs://node1:9870/user/hive/warehouse") \        .config("hive.metastore.uris", "thrift://node1:9083") \        .enableHiveSupport() \        .getOrCreate()    spark.sparkContext.setLogLevel("WARN")    spark.sql("show databases").show()    spark.sql("use sparkhive").show()    spark.sql("show tables").show()    spark=SparkSession.builder.master(_SPARK_HOST).appName(_APP_NAME).config("spark.sql.shuffle.partitions","4").config("spark.sql.warehouse.dir", "hdfs://192.168.150.102:9870/user/hive/warehouse") \        .config("hive.metastore.uris", "thrift://192.168.150.102:9083") \        .enableHiveSupport().getOrCreate()    spark.sql( "LOAD DATA LOCAL INPATH '/export/pyworkspace/pyspark_sparksql_chapter3/data/hive/student.csv' INTO TABLE person")

The error message is

SparkRuntimeException: [LOCATION_ALREADY_EXISTS] Cannot name the managed table as `spark_catalog`.`sparkhive`.`person`, as its associated location 'file:/home/spark-3.5.1-bin-hadoop3/spark-warehouse/sparkhive.db/person' already exists. Please pick a different table name, or remove the existing location first.

I want to adress this issue

spark for hive delete the table and create a new table with the same name ，The information already exists in the local file spark-warehouse

Trending Articles

Practice Sheet of Right form of verbs for HSC Students

Download: FK ft Shenky – Nakuyewa ”Prod by: Shenky”

How to win at Markstrat (Markstrat Tips and Tricks) – Vodites

Ominde Commission Report and Recommendations – Ominde Report of 1964

Bureau of Internal Revenue: Regional Offices (Directory)

GO 53 on Enhancement of Ex-gratia upto 5 Lakhs Toddy Tappers in Telangana

Cakewalk CA-2A Leveling Amplifier v2.0.1.97 WiN, v2.0.1.96 OSX Incl Keygen

Mp3 Download: Mdu - Kunjenjenjena

How the kill the job , when DTP request running for long hours.

Microsoft Intune から展開しているアプリのアップデートについて

18-year-old girl was beaten for half an hour by two Northampton men in 'an...

Car crash in Dunton Bassett leaves driver in critical condition

Macky 2, Two Others In Road Accident

Application log 00000000000000089514: Could not convert queue DLVST90CLNT

Detroit mafia: D’Anna Brothers agree to plea deal

Delivery block field greyed out using VA02

Muloraki Au

【個人撮影】スマホのプライベート映像♪「中に出さないで///」カラオケ屋での生ハメ撮りが流出ｗ【リベンジポルノ】＠PornHub

BREAKING NEWS: Diamond Platnumz Is Reported Dead After Ghastly Car Accident

FIAT 500 B0111 B0112