I might need to do something like chunk the vector and then serialize the chunks in parallel in order to get acceptable performance for an absurdly large vector like this that has 100,000 integers that themselves each need to be converted to bytes, but that feels like cheating since it's literally just throwing hardware at the problem instead of trying to use a better algorithm
hmm