Guide to Clojure REPL Driven Development with Emacs Doom
-
Updated
Apr 13, 2025 - HTML
Guide to Clojure REPL Driven Development with Emacs Doom
rddapp: Regression Discontinuity Design Application
A project on classification of GitHub readme sections using Machine Learning
Analysis of clinical trial data
This respository contains projects made for the Large Scale Data Analysis course at the AGH UST in 2024.
An assignment on preprocessing of text including tokenization, stop word removal
An assignment on preprocessing of text including tokenization, stop word removal, noise reduction, and stemming
A quiz on PySpark transformations and data analytics pipeline
Utilização de Joins no Databricks utilizando Spark Context
My solution to Introduction to Big Data with Apache Spark MOOC at Edx
This repository contains Databricks projects utilizing RDDs, DataFrames, and SQL to process and analyze various real-world datasets. Data cleaning and analysis have been performed using PySpark functions to handle challenges such as inconsistent formats, missing values, and complex data structures. The project ensures efficient data transformation
A midterm on breadth first search, map reduce, and PySpark transformations
Add a description, image, and links to the rdd topic page so that developers can more easily learn about it.
To associate your repository with the rdd topic, visit your repo's landing page and select "manage topics."