GitHub/Anki - OBJNULLs Forgejo: Beyond coding. We Forge.

mirror of https://github.com/ankitects/anki.git synced 2025-12-31 15:52:58 -05:00

Author	SHA1	Message	Date
Damien Elmes	a98126dab4	on upgrade make sure we set left	2011-09-23 10:31:31 +09:00
Damien Elmes	e7f416406d	refactor learning Rather than showing the user how many cards are in the learning queue, we want to be able to show them the number of reps they have to do to clear the queue, so they can better estimate the required time. Before we were counting up with the grade column, but this means we can't quickly sum up the number of reps left. So we invert it, and count down instead. I also dropped the 'first time bonus' for now. If there's enough demand for it, it can be added back by using the flags column, instead of a dedicated cycles column.	2011-09-23 10:29:49 +09:00
Damien Elmes	2b34d8a948	more group/sched refactoring - keep track of rep/time counts per group, instead of just at the top level - sort by due after retrieving learn cards - ensure activeGroups is sorted alphabetically - ensure new cards come in alphabetical group order - ensure queues are refilled when empty	2011-09-23 08:19:22 +09:00
Damien Elmes	024c42fef8	group scheduling refactor see the following for background discussion: http://groups.google.com/group/ankisrs-users/browse_thread/thread/4db5e82f7dff74fb - change sched index to the more efficient gid, queue, due - drop the dynamic index support. as there's no no q/a cache anymore, it's cheap enough to hit the cards table directly, and we can't use the index in its new form. - drop order by clauses (see todo) - ensure there's always an active group. if users want to study all groups at once, they need to create a top level group. we do this because otherwise the 'top level group' that's active when everything is selected is not clear. to do: - new cards will appear in gid order, but the gid numbers don't reflect alphabetical sorting. we need to change the scheduling code so that it steps through each group in turn - likewise for the learn queue	2011-09-22 11:54:01 +09:00
Damien Elmes	dac46752ed	drop the count_answered option	2011-09-18 09:42:04 +09:00
Damien Elmes	3e5041f337	reduce default cache size Use a more conservative 40MB for systems with a smaller amount of memory. Ideally we should bump this up if we detect the running system has a decent amount of memory.	2011-09-18 05:07:30 +09:00
Damien Elmes	2e628996a3	add flags back to cards, and fix syncing	2011-09-18 04:37:48 +09:00
Damien Elmes	f32787c18e	add guid and flags to facts Syncing and shared decks have conflicting priorities: - For syncing, we need to ensure that the deck remains in a consistent state. In the past, Anki allowed deletions to be overriden by a more recently modified object, but this could lead to a broken deck in certain circumstances. For example, if a user deletes a fact (and its cards) on one side, but does something to bump a card's mod time on another side, then when syncing the card would be brought back to life without its fact. Short of complex code to check all the relations, we're limited to two options: forcing a full sync when things are deleted, or ensuring objects can't come back to life. - When facts are shared between people, we need a way to identify if two facts arose from the same source. We can't compare based on content, as the content may have changed partially or completely. And we can't use the timestamp ids because of the above restriction on bringing objects back to life. If we did that, people could download a shared deck, decide they don't want it, and delete it. When they later decide to add it again, it wouldn't be possible: either nothing would be imported because of the old graves, or the ids would have to be rewritten. If we do the latter, the facts are no longer associated with each other, and we lose the ability to update the deck. So we need to give facts two IDs: one used as the primary key and for syncing, and another 'global id' for importing/sharing. I used a 64 bit random number, because a) it's what Anki's used in the past, so by reusing the old IDs we don't break existing associations on upgrade, and b) it's a decent compromise between the possibility of conflicts and performance. Also re-added a flags column to the facts. The 'data' column is intended to store JSON in the future for extra features without changing the schema, but that's slow for simple state checks. Flags will be used as a bitmask.	2011-09-18 04:31:53 +09:00
Damien Elmes	f3965f4c09	when suspending leeches, make sure we don't put the card back in the queue	2011-09-17 21:42:41 +09:00
Damien Elmes	4aedfd868b	count down again by default Upwards counts were a nice idea in theory, but when using them briefly in practice I quickly realized they're confusing. I'll probably pull the other option in the future.	2011-09-17 21:36:00 +09:00
Damien Elmes	64d13c2cbc	add a quick unit test to make sure groupCounts() works with changes	2011-09-15 01:42:48 +09:00
Damien Elmes	ee767ff132	refactor to allow group deletions without schema mod because group deletions are likely to be a semi-common operation (esp. for new users trying out shared material), deleting groups will no longer cause a full sync. in order to avoid syncing issues, we now allow cards/facts/etc to point to an invalid group, and in that case, we just treat them like they're in the default group	2011-09-15 01:37:30 +09:00
Damien Elmes	fa1b223363	use ms resolution for deck mod	2011-09-14 05:09:42 +09:00
Damien Elmes	bc9f6e6a24	add USNs Decks now have an "update sequence number". All objects also have a USN, which is set to the deck USN each time they are modified. When syncing, each side sends any objects with a USN >= clientUSN. When objects are copied via sync, they have their USNs bumped to the current serverUSN. After a sync, the USN on both sides is set to serverUSN + 1. This solves the failing three way test, ensures we receive all changes regardless of clock drift, and as the revlog also has a USN now, ensures that old revlog entries are imported properly too. Objects retain a separate modification time, which is used for conflict resolution, deck subscriptions/importing, and info for the user. Note that if the clock is too far off, it will still cause confusion for users, as the due counts may be different depending on the time. For this reason it's probably a good idea to keep a limit on how far the clock can deviate. We still keep track of the last sync time, but only so we can determine if the schema has changed since the last sync. The media code needs to be updated to use USNs too.	2011-09-13 21:10:21 +09:00
Damien Elmes	b391202e47	add failing three-way test While thinking about media syncing I realized the current sync algorithm is flawed in certain cases. It might be time to think about using a USN instead, as that should also hopefully solve the skewed clock problem properly.	2011-09-13 05:12:30 +09:00
Damien Elmes	87bfb38e2b	move db - if we store it inside the media folder, we inadvertently bump the folder mod time every time sqlite creates a journal file - close/reopen the media db as the deck is closed/opened	2011-09-12 05:03:31 +09:00
Damien Elmes	c59dd854fb	add change detection I removed the media database in an earlier commit, but it's now necessary again as I decided to add native media syncing to AnkiWeb. This time, the DB is stored in the media folder rather than with the deck. This means we avoid sending it in a full sync, and makes deck backups faster. The DB is a cache of file modtimes and checksums. When findChanges() is called, the code checks to see which files were added, changed or deleted since the last time, and updates the log of changes. Because the scanning step and log retrieval is separate, it's possible to do the scanning in the background if the need arises. If the DB is deleted by the user, Anki will forget any deletions, and add all the files back to the DB the next time it's accessed. File changes are recorded as a delete + add. media.addFile() could be optimized in the future to log media added manually by the user, allowing us to skip the full directory scan in cases where the only changes were manually added media.	2011-09-12 03:11:06 +09:00
Damien Elmes	7e1df75cc2	simplify media.py - drop mediaPrefix & the mediaURL-based downloading - always create the media folder - remove move() in preparation for a single collection approach	2011-09-11 00:25:22 +09:00
Damien Elmes	cf8288a5e0	todo	2011-09-09 23:06:12 +09:00
Damien Elmes	14b642f633	mod schema when rev order updated	2011-09-09 22:38:00 +09:00
Damien Elmes	9aad5c1166	more unit tests, fix bugs - make sure gconf has an id - merge deck conf	2011-09-09 22:34:50 +09:00
Damien Elmes	6cfe112f91	card tests	2011-09-09 21:24:52 +09:00
Damien Elmes	f15cb23c41	skip the 600 second pad during testing	2011-09-09 21:17:42 +09:00
Damien Elmes	85a2bb6193	revlog timestamp is ms based; should fetch facts/cards by mod not id	2011-09-09 21:11:11 +09:00
Damien Elmes	af2b2373b9	dueCounts()	2011-09-09 18:40:46 +09:00
Damien Elmes	362ae3eee2	initial work on sync refactor Ported the sync code to the latest libanki structure. Key points: No summary: The old style got each side to fetch ids+mod times and required the client to diff them and then request or bundle up the appropriate objects. Instead, we now get each side to send all changed objects, and it's the responsibility of the other side to decide what needs to be merged and what needs to be discarded. This allows us to skip a separate summary step, which saves scanning tables twice, and allows us to reduce server requests from 4 to 3. Schema changes: Certain operations that are difficult to merge (such as changing the number of fields in a model, or deleting models or groups) result in a full sync. The user is warned about it in the GUI before such schema-changing operations execute. Sync size: For now, we don't try to deal with large incremental syncs. Because the cards, facts and revlog can be large in memory (hundreds of megabytes in some cases), they would have to be chunked for the benefit of devices with a low amount of memory. Currently findChanges() uses the full fact/card objects which we're planning to send to the server. It could be rewritten to fetch a summary (just the id, mod & rep columns) which would save some memory, and then compare against blocks of a few hundred remote objects at a time. However, it's a bit more complicated than that: - If the local summary is huge it could exceed memory limits. Without a local summary we'd have to query the db for each record, which could be a lot slower. - We currently accumulate a list of remote records we need to add locally. This list also has the potential to get too big. We would need to periodically commit the changes as we accumulate them. - Merging a large amount of changes is also potentially slow on mobile devices. Given the fact that certain schema-changing operations require a full sync anyway, I think it's probably best to concentrate on a chunked full sync for now instead, as provided the user syncs periodically it should not be easy to hit the full sync limits except after bulk editing operations. Chunked partial syncing should be possible to add in the future without any changes to the deck format. Still to do: - deck conf merging - full syncing - new http proxy	2011-09-08 12:50:42 +09:00
Damien Elmes	7034c1ed29	drop syncName, fix leech unit test	2011-09-07 20:11:26 +09:00
Damien Elmes	d34465c1e6	halve the leech threshold, as it only applies to rev->relearn failures now	2011-09-07 20:02:47 +09:00
Damien Elmes	751cb7df67	add a new default for counts() As per the forum thread, the current due counts are really demotivating when there's a backlog of cards. In attempt to solve this, I'm trying out a new behaviour as the default: instead of reporting all the due cards including the backlog, the status bar will show an increasing count of cards studied that day. Theoretically this should allow users to focus on what they've done rather than what they have to do. The old behaviour is still there as an option.	2011-09-07 19:11:37 +09:00
Damien Elmes	8997d8cc8b	track all reps & time on a per-day basis We did away with the stats table because it's impossible to merge it, so the revlog is canonical now. But we also want a cheap way to display to the user how much time or how many cards they've done over the day, even if their study is split into multiple sessions. We were already storing the new cards of a day in the top level groups, so we just expand that out to log the other info too. In the event of a user studying in two places on the same day without syncing, the counts will not be accurate as they can't be merged without consulting the revlog, which we want to avoid for performance reasons. But the graphs and stats do not use the groups for reporting, so the inaccurate counts are only temporary. Might need to mention this in an FAQ. Also, since groups are cheap to fetch now, cards now automatically limit timeTaken() to the group limit, instead of relying on the calling code to do so.	2011-09-07 18:48:29 +09:00
Damien Elmes	28d045feef	rewrite groupCounts() Instead of collecting the exact number of cards, we just record whether a group has any reviews or new cards. By not needing to calculate the exact numbers, it runs a lot faster than before. Also, changed the group code to ensure parents are automatically created when a group is added.	2011-09-07 03:02:07 +09:00
Damien Elmes	de8a5b69ed	top level groups As discussed on the forums, moving to a single collection requires moving some deck-level configuration into groups so users can have different settings like new cards/day for each top level item. Also: - store id in groups - add mod time to gconf updates - move the limiting code that's not specific to scheduling into groups.py - store the current model id per top level group	2011-09-07 01:31:46 +09:00
Damien Elmes	5179d82f7f	drop the sources table; we'll store it in deck conf	2011-09-06 23:24:51 +09:00
Damien Elmes	9130e09b3e	rename some columns for consistency - revlog's 'time' is now 'id', like the other tables - 'taken' is now 'time' - also dropped the eta code	2011-09-06 21:33:19 +09:00
Damien Elmes	b3937a3280	make sure currentModelId is set and css regenerated on upgrade	2011-08-28 15:14:04 +09:00
Damien Elmes	c0e992618c	add groups.all()	2011-08-28 14:58:22 +09:00
Damien Elmes	cca48a0ced	thinko	2011-08-28 14:40:52 +09:00
Damien Elmes	6a00419ebc	merge deck.qconf and deck.conf	2011-08-28 14:17:33 +09:00
Damien Elmes	a30836445e	modelCache is no longer used	2011-08-28 13:52:32 +09:00
Damien Elmes	a9b4285959	rename a few methods for consistency	2011-08-28 13:48:17 +09:00
Damien Elmes	be5c5a2018	move tags into deck; code into separate file - moved tags into json like previous changes, and dropped the unnecessary id - added tags.py for a tag manager - moved the tag utilities from utils into tags.py	2011-08-28 13:44:29 +09:00
Damien Elmes	d20984a686	fix upgrade	2011-08-28 00:20:24 +09:00
Damien Elmes	78600e8ed6	move group code into a registry like models	2011-08-27 23:45:55 +09:00
Damien Elmes	d3a3edb707	move models into the deck table Like the previous change, models have been moved from a separate DB table to an entry in the deck. We need them for many operations including reviewing, and it's easier to keep them in memory than half on disk with a cache that gets cleared every time we .reset(). This means they are easily serialized as well - previously they were part Python and part JSON, which made access confusing. Because the data is all pulled from JSON now, the instance methods have been moved to the model registry. Eg: model.addField(...) -> deck.models.addField(model, ...). - IDs are now timestamped as with groups et al. - The data field for plugins was also removed. Config info can be added to deck.conf; larger data should be stored externally. - Upgrading needs to be updated for the new model structure. - HexifyID() now accepts strings as well, as our IDs get converted to strings in the serialization process.	2011-08-27 22:27:09 +09:00
Damien Elmes	7afe6a9a7d	convert groups to json; use timestamp ids for all but default Rather than use a combination of id lookups on the groups table and a group configuration cache in the scheduler, I've moved the groups and group config into json objects on the deck table. This results in a net saving of code and saves one or more DB lookups on each card answer, in exchange for a small increase in deck load/save work. I did a quick survey of AnkiWeb, and the vast majority of decks use less than 100 tags, and it's safe to assume groups will follow a similar pattern. All groups and group configs except the default one will use integer timestamps now, to simplify merging when syncing and importing. defaultGroup() has been removed in favour of keeping the models up to date (not yet done).	2011-08-27 17:13:04 +09:00
Damien Elmes	47be8b0546	use the timestamps instead of forcing id on fact/card creation - we ditch nextCid/nextFid as we don't need incrementing ids anymore - we add nextPos so we can maintain a user-friendly position number	2011-08-27 00:36:39 +09:00
Damien Elmes	f7b89c9fa1	ensure unique id on per-object add, too	2011-08-26 22:51:08 +09:00
Damien Elmes	ebac628187	ensure duplicate model creation times are accounted for	2011-08-26 22:33:24 +09:00
Damien Elmes	5868ff52b9	handle duplicate fact creation times	2011-08-26 22:28:35 +09:00
Damien Elmes	f6189f453a	update card ids - upgrade facts before cards so we don't have to rewrite the cards table twice - ensure duplicate card creation times are accounted for	2011-08-26 22:25:20 +09:00

... 4 5 6 7 8 ...

1856 commits