Advertisement
Not a member of Pastebin yet?
Sign Up,
it unlocks many cool features!
- id,matricula,disciplina
- 1,123,44
- 2,234,44
- ## gerando rdd do arquivo csv (sem cabeçalho)
- data = sc.textFile("your File Path\matriculas.csv")
- data = data.map(lambda x: x.split(","))
- ## ALTERNATIVA (quando tem cabeçalho)
- data = sc.textFile('path_to_data')
- header = data.first() #extract header
- data = data.filter(row => row != header) #filter out header
Advertisement
Add Comment
Please, Sign In to add comment
Advertisement