The Comprehensive Manual Coding Validation Tool is a Streamlit-based web application designed to assist researchers in validating automated coding results through manual review and comparison. This tool provides an intuitive interface for reviewing coded data, adjusting labels, and analyzing the accuracy of automated coding processes.
You can try out the tool without any installation by visiting my hosted Streamlit app:
https://manual-coding-validation-tool.streamlit.app
- Custom Dataset Upload: Users can upload their own CSV datasets or use a default dataset for illustration.
- Flexible Column Selection: Select columns for text content, predicted labels, and additional information.
- Multiple Sampling Methods: Choose between Binary Classification, Multi-class Random Sampling, or Stratified Sampling.
- Interactive Coding Interface: Easily navigate through samples and adjust labels.
- Real-time Statistics: View accuracy, precision, recall, and F1 score updates as you code.
- Confusion Matrix Visualization: Understand classification performance with an interactive confusion matrix.
- Progress Tracking: Monitor your coding progress with a dynamic progress bar.
- Export Options: Save your validated samples in CSV, Excel, or JSON formats.
- Dark Mode: Toggle between light and dark themes for comfortable viewing.
-
Clone this repository:
git clone https://github.com/yourusername/manual-coding-validation-tool.git
-
Navigate to the project directory:
cd manual-coding-validation-tool
-
Install the required dependencies:
pip install -r requirements.txt
-
Run the Streamlit app:
streamlit run app.py
-
Open your web browser and go to
http://localhost:8501
. -
Follow the on-screen instructions to upload your dataset or use the default one.
-
Select the appropriate columns for text content and predicted labels.
-
Choose your sampling method and generate a sample.
-
Start coding! Review each item, adjust labels as needed, and submit.
-
Monitor your progress and view real-time statistics.
-
When finished, export your validated sample in your preferred format.
Contributions to improve the Comprehensive Manual Coding Validation Tool are welcome! Please follow these steps to contribute:
- Fork the repository.
- Create a new branch for your feature or bug fix.
- Make your changes and commit them with a clear commit message.
- Push your changes to your fork.
- Submit a pull request with a description of your changes.
This project is licensed under the MIT License - see the LICENSE file for details.
I am working on improving and expanding the capabilities of this tool. Currently planned improvements:
- Collaborative Coding: Enable multiple users to work on the same dataset simultaneously.
- Advanced Visualization: Implement more complex visualizations for in-depth analysis of coding patterns.
- Multi-language Support: Add support for multiple languages in the user interface.
- Mobile Responsiveness: Optimize the interface for mobile devices to enable coding on-the-go.
- Performance Optimization: Improve loading and processing times for larger datasets.
WeI welcome suggestions and contributions for these and other improvements!
For questions, issues, or suggestions, please open an issue on the GitHub repository.