- Apr 21, 2024
-
-
Aaron Henning authored
-
Aaron Henning authored
-
Aaron Henning authored
-
Aaron Henning authored
Add requirements.txt Update README.md Add new global var for feature name file Add code for generating html view of feature names and examples Update database code to include text and token Update .gitignore file Move some methods around
-
- Apr 18, 2024
-
-
Aaron Henning authored
-
Aaron Henning authored
Update gitignore file Move activation db code to own file Add code for actually naming a feature Update README.md
-
- Apr 16, 2024
-
-
Aaron Henning authored
Improve SAE stuff by making sure it is on cuda Improve getting activations for features (setting limits) Add first test of stored activations from database Add more global variable (I love those) for improving naming conventions Move models to folder to keep everything crispy clean
-
- Apr 14, 2024
-
-
Aaron Henning authored
-
Aaron Henning authored
-
Aaron Henning authored
Add database class to store results
-
Aaron Henning authored
-
Aaron Henning authored
Improve autoencoder training Move constants and SparseAutoencoder class to own file Move hyperparameters to own file Update .gitignore
-
Aaron Henning authored
-
- Apr 10, 2024
-
-
Aaron Henning authored
Add visualization of sae feature activations for a few sample data points. Does not work properly yet
-
- Apr 09, 2024
-
-
Aaron Henning authored
-
Aaron Henning authored
Add DataLoader class to load activations
-
- Apr 07, 2024
-
-
Aaron Henning authored
-
Aaron Henning authored
Improve calculating activations Scale down model and therefore compute and disk requirements Reduce reserved memory for slurm task, as it was not needed
-
- Apr 05, 2024
-
-
Aaron Henning authored
-
Aaron Henning authored
Add calculate_activations.py Update .gitignore Implement retrieving activations from model for data
-
Aaron Henning authored
Add .gitignore Add run shell files for slurm Implement loading and querying Phi2 Implement downloading and preprocessing data
-